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RNA INTERFERENCE PATHWAY GENES AS TOOLS 

FOR TARGETED GENETIC INTERFERENCE 



Related Application Information 
This application claims priority from provisional application serial numbers 
60/159,776, filed October 15, 1999, and 60/193.218, filed March 30. 2000, 



Statement as to Federally Sponsored Research 
1 0 Funding for the work described herein was provided by the federal government 

(GM58800 and GM37706), which has certain rights in the invention. 

Field of the Invention 
This invention relates to the discovery of genes whose expression products are 
1 5 involved in mediation of genetic interference. 



Background of the Invention 
All eukaiyotic organisms share similar mechanisms for information transfer &om 
DNA to RNA to protein. RNA interference represents an efficient mechanism for 

20 inactivating this transfer process for a specific targeted gene. Targeting is mediated by 
the sequence of the RNA molecule introduced to the cell. Double-stranded (ds) RNA can 
induce sequence-specific inhibition of gene function (genetic interference) in several 
organisms including the nematode, C elegans(¥m,cidl., 1998, iVia/«re 391:806-811), 
plants, tiypanosomes, Drosopkila^ and planaria (Waterhouse et al., 1998, Proc, Natl 

25 Acad. Set USA 94:13959-13964; Ngo et al, 1998, Proc, Nad Acad Set. USA 95:14687- 
14692; Kennerdell and Carthew, 1998, Cell 95:1017-1026; Misquitta and Patterson, 
1999, Proc, Natl. Acad. Sci, USA 96: 1451-1456; Sanchez-Alvorado andNewmark, 1999. 
Proc, Nad. Acad. Sci. USA 96:5049-5054), The discovery that dsRNA can induce 
genetic interference in organisms from several distinct phyla suggests a conserved 

30 mechanism and perhaps a conserved physiological role for the interference process. 
Although several models of RNAi have been proposed (Baulcombe, 1999, Curr. BioL 
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9:R599-R60l; Sharp, 1999, Genes & Dev. 13:139-141) the mechanisms of action of 
specific components of the pathway arc not Icnown. 

Attempts to overexpress a gene (e.g., a transgene) often lead only to transient 
expression of the gene. Furthermore, the even more undesirable effect of 
5 "cosuppression" can occur in which a corresponding endogenous copy of the transgene 
becomes inactivated. In some cases, transgene silencing leads to problems with the 
commercial or therapeutic application of transgenic technology to alter the genetic 
makeup of a cell, organism, or human patient 

10 Summary of the Invention 

The present invention relates to the discovery of RNA interference (RNAi) 
pathway genes which are involved in mediating double-stranded RN A-dependent gene 
silencing (genetic interference). RNAi requires a set of conserved cellular fectors to 
suppress gene expression. These factors are the components of the RNAi pathway. The 

1 5 RNAi pathway mutations and genes described herein (e.g., rde-1, rde-2, rde-3, rde-4, rde- 
5, mut-2, and mut-7), and their protein products (e.g., RDE-1 and RDE-4) are useful tools 
for investigating the mechanisms involved in RNAi and developing methods of 
modulating the RNAi pathway. The sequences and methods described herein are useful 
for modulating the RNAi pathway and may be used in conjunction with other methods 

20 involving the use of genetic inhibition by dsRNA (e.g., see U.S.S.N. 09/21 5,257, filed 
December 18, 1998, incorporated herein by reference in its entirety). 

RNAi pathway components (e.g., RDE-1, RDE-4) provide activities necessary for 
interference. These activities may be absent or not sufficiently activated in many cell 
types, including those of organisms such as humans in which genetic interference may 

25 have potential therapeutic value. Components of the RNAi pathway in C. elegans may 
be sufBcient when provided through transgenesis or as direct RNA5>rotem complexes to 
activate or directly mediate genetic interference in heterologous cells tliat are deficient in 
RNAi. 

Nucleic acid sequences encoding RNAi pathway components (e.g., RDE-l, RDE- 
30 4) are useful, e.g., for studying the regulation of the RNAi pathway. Such sequences can 
also be used to generate knockout strains of animals such as C. elegans, 

2 
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The nucleic acids of the invention include nucleic acids that hybridize, e.g., under 
stringent hybridization conditions (as defined herein), to all or a portion of the nucleotide 
sequence of SEQ ID NO: I (Figure 5A-C) or its complement; SEQ ID N0:2 (Figure 6A- 
D) or its complement, or SEQ ID N0:4 or its comptement The hybridizing portion of 
5 the hybridizing nucleic acids are preferably 20, 30, 30, or 70 bases long. Preferably, the 
hybridizing portion of the hybridizing nucleic acid is 80%, more preferably 95%, or even 
98% or 100% identical to the sequence of a portion or all of a nucleic acid encoding an 
RDE-1 polypeptide or an RDE-4 polypeptide. Hybridizing nucleic acids of the type 
described above can be used as a cloning probe, a primer (e.g., a PGR primer), or a 

10 diagnostic probe. Preferred hybridizing nucleic acids encode a polypeptide having some 
or all of the biolo^cal activities possessed by a naturally*occurring RDE-1 polypeptide or 
an RDE-4 polypeptide e.g., as determined in the assays described below. 

Hybridizing nucleic acids may encode a protein that is shorter or longer than the 
RDE-1 protein or RDE-4 protein described herein. Hybridizing nucleic acids may also 

1 5 encode proteins that are related to RDE- 1 or RDE-4 (e.g., proteins encoded by genes that 
include a portion having a relatively high degree of identity to the rde-l gene or rde-4 
gene described herein). 

The invention also features purified or isolated RDE-1 polypeptides and RDE-4 
polypeptides. RDE-1 and RDE-4 polypeptides are useful for generating and testing 

20 antibodies that specifically bind to an RDE-1 or an RDE-4. Such antibodies can be used, 
e.g., for studying the RNAi pathway in C. elegans and other organisms. As used herein, 
both "protein" and "polypeptide" mean any chain of amino acids, regardless of lengdi or 
post-translational modification (e.g., glycosylation or phosphorylation). Thus, the term 
"RNAi pathway polypeptide" includes a full-length, naturally occurring RNAi pathway 

25 polypeptide such as RDE-1 protein or RDE-4 protein, as well as recombinantly or 

synthetically produced polypeptides that correspond to a full-length, naturally occurring 
RDE-1 protein, RDE-4 protein, or to particular domains or portions of a naturally 
occurring RNAi pathway protein. 

RNAi pathway mutations and strains harboring those mutations (e.g., rde*l , rde-2, 

30 rde-3, rde-4, rde-5) arc useful for studying the RNAi pathway, including identification of 
modulators of the RNAi pathway. 

3 
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RNAi pathway components (e.g., those associated with mut-7 and rde>2) can be 
used to desil^ce or prevent silencing of transgenes. To facilitate this function, such 
RNAi pathway components are inhibited using specific inhibitors of an RNAi pathway 
gene or its product. 

5 In one embodiment, the invention includes an isolated nucleic acid molecule 

comprising a nucleotide sequence encoding an RD£-l polypeptide. The nucleic acid 
molecule hybridizes under high stringency conditions to the nucleic acid sequence of 
Genbank Accession No. AFi 80730 (SEQ ID N0:2) or its complement, or the sequence 
of SEQ ID N0:1 or its complement. In one embodiment, the isolated nucleic acid can 
10 complement an rde-1 mutation. The invention also encompasses an isolated nucleic acid 
whose nucleotide sequence encodes the amino acid sequence of SEQ ID N0:3. 

The invention also encompasses a substantially pure RDE-1 polypeptide encoded 
by the isolated nucleic acids described herein. 

The invention features an antibody that specifically binds to an RDE-1 
15 polypeptide. 

The invention also includes a method of enhancing the expression of a transgene 
in a cell, the method comprising decreasing activity of the RNAi pathway. In one 
embodiment of this invention, rde-2 e^^ression or activity is decreased. 

The invention also features an isolated nucleic acid molecule comprising a 

20 nucleotide sequence encoding an RDE-4 polypeptide, wherein the nucleic acid molecule 
hybridizes under high stringency conditions to the nucleic acid sequence of SEQ ID 
N0:4 or its complement. The invention also encompasses an isolated nucleic acid 
encoding an RDE-4 polypeptide, wherein the nucleic acid can complement an rde<4 
mutation. The invention also encompasses an isolated nucleic acid encoding an RDE-4 

25 polypeptide, in which the nucleotide sequence encodes the amino acid sequence of SEQ 
IDN0:5. 

The invention also features a substantially pure RDE-4 polypeptide encoded by 
the isolated nucleic acids described herein. 

In another embodiment the invention features an antibody that specifically binds 
30 to an RDE-4 polypeptide. 

The invention also features a method of preparing an RNAi agent, the method 

4 
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includes incubating a dsRNA in the presence of an RDE-1 protein and an RDE-4 protein. 

The invention also features a method of inhibiting the activity of a gene by 
introducing an RNAi agent into a cell, such that the dsRNA component of the RNAi 
agent is targeted to the gene. In another embodiment of the invention, the cell contains 

5 an exogenous RNAi pathway sequence. The exogenous RNAi pathway sequence can be 
an RDE- 1 polypeptide or an RDE-4 polypeptide. In still another embodiment, a dsRNA 
is introduced into a cell containing an exogenous RNAi pathway sequence such as 
nucleic acid sequence expressing an RDE-1 or RDE-4. 

An RNAi pathway component is a protein or nucleic acid that is involved in 

1 0 promoting dsRN A-mediated genetic interference. A nucleic acid component can be an 
RNA or DNA molecule. A mutation in a gene encoding an RNAi pathway component 
ttidcy decrease or increase RNAi pathway activity. 

An RNAi pathway protein is a protein that is involved in promoting dsRNA 
mediated genetic interference. 

1 5 A "substantially pure DNA" is a DNA that is not immediately contiguous with 

(i.e., covalentiy linked to) both of the coding sequences with which it is immediately 
contiguous (i.e., one at the 5* end and one at the 3* end) in the naturally-occurring genome 
of the orgam'sm from which the DNA of the invention is derived. The term therefore 
includes, for example, a recombinant DNA which is incorporated into a vector, into an 

20 autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or 
eukaryote; or which exists as a separate molecule (e.g., a cDN A or a genomic or cDNA 
fragment produced by PCR (polymerase chain reaction) or restriction endonuclease 
digestion) independent of other sequences. It also includes a recombinant DNA which is 
part of a hybrid gene encoding additional polypeptide sequences. 

25 By "inhibited RNAi pathway" is meant decreased inhibitory activity of a dsRNA 

which results in at least two-fold less inhibition by a dsRNA relative to its ability to cause 
inhibition in a wild type cell Techniques for measuring RNAi pathway activity are 
described herein. The pathway can be inhibited by inhibiting a component of the 
pathway (e.g,, RDE-1) or mutating the component so that its function is reduced, 

30 A ^'substantially pure polypeptide" is a polypeptide, e.g., an RNAi pathway 

polypeptide or fragment thereof, that is at least 60%, by weight, free from the proteins 
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and naturally-occurring organic molecules with which it is naturally associated. 
Preferably, the preparation is at least 75%. more preferably at least 90%, and most 
preferably at least 99%; by weight, RNAi pathway polypeptide or fragment A 
substantially pure RNAi pathway polypeptide or-fragment thereof is obtained, for 
5 example, by extraction from a natural source; by expression of a recombinant nucleic 
acid encoding an RNAi pathway polypeptide or fragment thereof, or by chemically 
synthesiang the polypeptide or fragment Purity can be measured by any appropriate 
method, c.g., column chromatography, polyacrylamide gel electrophoresis, or HPLC 
analysis* 

1 0 By "specifically binds" is meant a molecule that binds to a particular entity, e.g., 

• an RNAi pathway polypeptide, but which does not substantially recognize or bind to 
other molecules in a sample, e.g., a biological sample, which includes the particular 
entity, e.g., RDE-1. 

An RNAi agent is a dsRNA molecule that has been treated with those components 
15 of the RNAi pathway that are required to confer RNAi activity on the dsRN A. For 

example, treatment of a dsRNA under conditions that include RDE-1 and RDE-4 results 
in an RNAi agent Injection of such an agent into an animal that is mutant for RDE-1 and 
RDE-4 will result in activation of the RNAi pathway with respect to a targeted gene. 
Typically, the dsRNA used to trigger the formation of the RNAi agent is selected to be an 
20 RNA corresponding to all or a portion of the nucleotide sequence of the targeted gene. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although methods and materials similar or equivalent to those 
described herein can be used m the practice or testing of the fH-esent invention, suitable 
25 methods and materials are described below. All publications, patent applications, 
patents, and other references mentioned herein are incorporated by reference. In 
addition, tiie materials, metfiods, and examples are illustrative only and not intended to be 
limiting. 

Other features and advantages of the invention will be apparent from the detailed 
30 description, and from the claims. 
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Brief Description of the Drawings 
Figure 1 A illustrates the genetic scheme used to identify rde mutants. 
Figure IB is an illustration summarizing data from the genetic mapping of rde 
and mut mutations. The vertical bars represent chromosomes; LGI, LGIII, and LGV. 
5 Reference genetic markers are indicated at the right of each chromosome and the relative 
genetic positions of the rde and mut alleles are indicated at the left. 

Figure 2A is a graphical representation of experiments investigating the 
sensitivity of rde and mut strains to RNAi by microinjection. The RNA species mdicated 
above each graph was injected at high concentration (pos-J: Img/mU par-2: 3mg/mi, sqt- 
\ 0 i: 7mg/ml). The strains receiving injection are indicated at the left and the horizontal bar 
graphs reflect the percent of progeny that exhibited genetic intwference. The Unc marker 
mutants used are also indicated. The percent embryonic lethality of Fl progeny is plotted 
as shaded bars and the fraction of affected progeny is indicated at the right of each graph. 
Figure 2B is a graphical representation of experiments demonstrating that animals 
1 5 homozygous for rde and mut alleles are resistant to RNAi targeting maternally expressed 
gcnQSy poS'l and par'2. The percent embryonic legality of Fl progeny is plotted as 
shaded bars and the fraction of affected progeny is indicated at the right of each graph. 

Figure 3 is a schematic representation of homozygous rde'l(ne219) and rde- 
4(ne299) mutant mothers receiving injections of dsRNA targeting the body muscle 
20 structural gene unc-22. 

Figure 4 A is a schematic representation of the physical map of the rde* I region. 
C. elegans YAC and cosmid DN A clones that were positive for rescue are indicated by 
an asterisk. A representation of the expanded interval showing a minimal, 25kb, rescuing 
interval defined by the overlap between cosmids T10A5 and C27H6 is shown beneath the 
25 YAC and cosmid map. Predicted genes vrithin this sequenced interval are illustrated 
above and below the hatch marked line. A single, rescuing, 4.5kb PGR fragment 
containing die KO8H10.7 predicted gene is shown enlarged. Exon and intron (box/line) 
boimdaries are shown as well as the positions of rde-I point mutation in the predicted 
coding sequences. 

30 Figure 43 is an illustration of the predicted sequence of RDE- 1 and its alignment 

with four related proteins. The sequences are RDE-1 (C. elegans; Genbank Accession 
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No. AFl 80730), F48F7.1 (C. elegans; Genbank Accession No. Z69661)> cIF2C (rabbit; 
Genbank Accession No. AF005355), ZWILLE {Arabidopsis; Genbank Accession No, 
AJ223508), and Sting (Drosophila; Genbank Accession No. AF145680). Identities with 
RDE-t aie shaded in black, and identities among the homologs are shaded in gray. 

5 Figures 5A-5C are an illustration of the genomic sequence from cosmid K08HiO 

(Genbank accession Z831 13.1; SEQ ID N0:1) correspondmg to the rde-l gene from the 
first nucleotide of 5' untranslated region to the polyadenylation site. 

Figures 6A-6D are an illustration of the cDNA sequence of rde-l (SEQ ID N0:2), 
includbg the first 20 nucleotides constituting the 5' untranslated sequence (5'UTR) and 

1 0 the predicted amino acid sequence encoded by rde*l (RDE-1 ; SEQ ID N0:3). The 
nucleotide sequence is numbered starting with the first nucleotide of the translated 
region. 

Figure 7 A is an illustration of the protocol for injection of a wld-type 
hermaphrodite with dsRN A. 

1 5 Figure 7B is an illustration of a genetic scheme demonstrating extragenic 

inheritance of RNAi. The fraction shown represents the number of RN Ai affected F2 
hermaphrodites over the total number of cross progeny scored for each genotype class. 
Phenotypically uncoordinated (Unc). 

Figures 8A-8B are illustrations of a genetic scheme to determine if the wild-type 

20 activities rde-l, rde-2, rde'4, and mut-? are sufficient in the injected animal for 
interference among the Fl self progeny (A) illustrates crosses of heterozygous 
hermaphrodites; (B) illustrates crosses using homzygous Fl progeny from heterozygous 
mothers. The fraction shown represents the number of RNAi affected animals over the 
total number of cross progeny scored for each genotype class. 

25 Figure 9A depicts experiments of a the genetic scheme to determine if the wild- 

type activities of rde-I, rde-2, rde'4, and mut-J are sufficient in the injected animal for 
interference among the Fl self progeny. The fraction shown represents the number of 
RNAi affected animals over the total number of cross progeny scored for each genotype 
class. 

30 Figure 9B depicts experiments designed to determine the requirements for rde-l, 

rde-2. rde-4, and mut-J in F2 (Fig. lOA) and Fl (Fig. lOB) interference. The fraction 
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shown represents the number of RNAi afFected animals over the total number of cross 
progeny scored for each genotype class* 

Figures lOA-lOB are a depiction of the cDNA sequence of a wild type rde'4 
nucleic acid sequence (SEQ ID N0:4) and the predicted RDE-4 amino acid sequence 
5 (SEQ ID N0:5) of C. elegans. indicates ambiguous base assignment. 

Figure 11 is a depiction of regions of homology between the predicted RDE-4 
amino acid sequence, XIRBPA (SEQ ID N0:6), HsPKR (SEQ ID N0:7), and a 
consensus sequence (SEQ ID NO:S). A predicted secondary suructure for RDE'4 is also 
shown illustrating predicted regions of a helix and P pleated sheet. 
10 Figure 12 illustrates a scheme for rescue of an rde'4. 

Detailed Desaription 
Mutations have been discovered that identify genes involved in dsRNA-mediated 
genetic interference (RNAi). RNAi pathway genes encode products involved in genetic 

15 interference and are useful for mediating or enhancing genetic interference. These genes 
encode mediators of double-stranded RNA-mediated interference. The mediators can be 
nucleic acid or protein. RNAi pathway genes are also useful for mediating specific 
processes, e.g., a gene that mediates dsRNA uptake by cells may be useful for 
transporting other RNAs into cells or for facilitating entry of agents such as drugs into 

20 cells. The methods and examples described below illustrate the identification of RNAi 
pathway components, the uses of RNAi pathway components, mutants, genes and their 
products. 

Identification of an RNAi-deficient mutants and an RNAi pathway gene. rde»l 
25 RNAi pathway genes were identified using screens for C elegans strains mutant 

for RNAi (Examples 2 and 3). The mutations were further characterized for germiine 
and somatic effects, effects on transposon mobilization, X chromosome loss and 
transgene silencing, and target tissue activity (Examples 4 and 5). 

The rde*l gene was identified using YACs (yeast artificial chromosomes) and 
30 cosmids to rescue rde-l mutants. Based on the identified sequence, a cDNA sequence 
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was identified in a C elegans cDNA libraiy and the complete cDNA sequence 
determined (Example 6). 

Identification of RNAi Pathway Genes Homologous to rde-K rdt-l. rdc-3. and rde-4 
5 RNAi pathway genes from C. elegans (such as those described herein) and from 

other organisms (e.g. plant, mammalian, especially human) are useful for the elucidation 
of die biochemical pathways involved in genetic interference and for developing the uses 
of RNAi pathway genes described herein. 

Several approaches can be used to isolate RNAi pathway genes including two* 
1 0 hybrid screens, complementation of C. elegans mutants by expression libraries of cloned 
heterologous (e,g., plant, manunalian, human) cDNAs, polymerase chain reactions (PGR) 
primed with degenerate oligonucleotides, low stringency hybridization screens of 
heterologous cDNA or genomic libraries with a C. elegans RNAi pathway gene, and 
database screens for sequences homologous to an RNAi pathway gene. Hybridization is 
1 5 performed under stringent conditions. Altematively, a labeled fragment can be used to 
scieen a genomic library derived from the organism of interest, again, using appropriately 
stringent conditions. Such stringent conditions are well known, and will vary predictably 
depending on the specific organisms from which the library and the labeled sequences are 
derived. 

20 Nucleic acid duplex or hybrid stability is expressed as the melting temperature or 

Tm, which is the temperature at which a probe dissociates from a target DNA. This 
melting temperature is used to define the required stringency conditions. If sequences are 
to be identified that are related and substantially identical to the probe, rather than 
identical, then it is useful to first establish the lowest temperature at which only 

25 homologous hybridization occurs with a particular SSC or SSPE concentration. Then 
assume that 1% mismatching results in rC decrease in the Tm and reduce the temperature 
of the final wash accordingly (for example, if sequences with > 95% identity with the 
probe are sought, decrease the final wash temperature by 5*^C). Note that this assumption 
is very approximate, and the actual change in Tb, can be between 0.5** and 1 .5°C per 1% 

30 mismatch. 
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As used herein, high stringency conditions include hybridizing at Si^'C in 5x 
SSaSx Denhardt solution/ 1.0% SDS, or in 0.5 MNaHP04 (pH 7.2)/l mM EDTA/7% 
SDS, or in 50% fonnaniide/0.25 M NaHP04 (pH 7.2V0.25 M NaCl/l mM EDTA/7% 
SDS; and washing in 0.2x SSaO.1% SDS at room temperature or at 42T, or in 0, Ix 
5 SSC/0.1% SDS at 68**C, or in 40 mM NaHPO* (pH 7.2)/l mM EDTA/5% SDS at 50^C, 
or in 40 mM NaHP04 (pH 7.2) 1 mM EDTA/1% SDS at SO^C. Moderately stringent 
conditions include washing in 3x SSC at 42*C. The paiamcters of salt concentration and 
temperature can be varied to achieve the desired level of identity between the probe and 
the target nucleic acid. 

10 For guidance regarding such conditions see, for example, Sambrook et al., 1989, 

Molecular Cloning. A Laboratory Manual, Cold Springs Harbor Press, N.Y.; and Ausubel 
et al. (eds.), 1995, Current Protocols in Molecular Biology, (John Wiley & Sons, N.Y.) at 
Umt 2.10. 

Methods of screening for and identifying homologs of C elegans RNAi genes 
1 5 (e.g., rde-1 ) are known in the art For example, complementation of mutants, described 
in the Examples can be performed using nucleic acid sequences from organisms other 
than C. elegans. Methods of inhibiting expression of a target gene in a cell using dsRN A 
are known in the art and are exemplified in U.S.S.N. 09/215,257, filed December 18, 
1998, which is incorporated herein by reference in its entirety, 
20 Another method of screening is to use an identified RNAi pathway gene sequence 

to screen a cDNA or genomic library using low stringency hybridizations. Such methods 
are known in the art. 

PCR with degenerate oligonucleotides is another method of identifying homologs 
of RNAi pathway genes (e.g., human rde-l). Homologs of an RNAi pathway gene 

25 identified in other species are compared to identify specific regions with a high degree of 
homology (as in the sequence comparison shown in Figure 4). These regions of high 
homology are selected for designing PCR primers that maximize possible base-pairing 
mth heterologous genes. Construction of such primers involves the use of 
oligonucleotide mixtures Uiat account for degeneracy in the genetic code, i.e., allow for 

30 the possible base changes in an RNAi pathway gene that does not affect the amino acid 
sequence of the RNAi pathway protein. Such primers may be used to amplify and clone 
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possible KN Ai pathway gene fragments from DNA isolated from another ozganism (e.g., 
mouse or human). The latter are sequenced and those encoding protein fragments with 
high degrees of homology to fragments of the RNAi pathway protein are used as nucleic 
acid probes in subsequent screens of genomic DNA and cDNA libraries (e.g., mouse or 
S human). Full-length genes and cDNAs having substantial homology to the previously 
identified RNAi pathway gene are identified in these screens. 

To produce an RNAi pathway gene product (e.g., RDE-1) a sequence encoding 
the gene is placed in an expression vector and the gene expressed in an appropriate cell 
type. The gene product is isolated from such cell lines using methods known to those in 
10 the art, and used in the assays and procedures described herein. The gene product can be 
a complete RNAi pathway protein (e.g., RDE-1) or a fragment of such a protein. 

Methods of Expressing RNAi Pathway Proteins 

Full-length polypeptides and polypeptides corresponding to one or more domains 

15 of a frill-length RNAi pathway protein, eg., the RN A-binding domain of RDE-4, are also 
within the scope of the invention. Also within the invendon are frision proteins in which 
a portion (e.g., one or more domains) of an RDE-1 or RDE-4) is frised to an unrelated 
protein or polypeptide Ci.e., a frision partner) to create a fusion protein. The frision 
partner can be a moiety selected to facilitate purification, detection, or solubilization, or 

20 to provide some other frmction. Fusion proteins are generally produced by expressing a 
hybrid gene in which a nucleotide sequence encoding all or a portion of of an RNAi 
pathway protein is joined in-frame to a nucleotide sequence encoding the fusion partner. 
Fusion partners include^ but are not limited to, the constant region of an immunoglobulin 
(IgFc). A fusion protein in which an RNAi pathway polypeptide is frised to IgFc can be 

25 more stable and have a longer half-life in the body than the polypeptide on its own. 

In general, RNAi pathway proteins (e.g., RDE-1, RDE-4) according to the 
invention can be produced by transformation (transfection, transduction, or infection) of a 
host cell with all or part of an RNAi pathway protein-encoding DNA fragment (e.g., one 
of the cDNAs described herein) in a suitable expression vehicle. Suitable expression 

30 vehicles include: plasmids, viral particles, and phage. For insect cells, baculovirus 
expression vectors are suitable. The entire expression vehicle, or a part thereof, can be 



12 



wo 01/29058 



PCT/lISOO/28470 



integrated into the host cell genome. In some circumstances, it is desirable to employ an 
inducible expression vector. e.g., the LACS WITCH™ Inducible Expression System 
(Stratagene; LaJolla, CA). 

Those skilled in the field of molecular biology will understand that any of a wide 
5 variety of expression systems can be used to provide the recombinant proteia The 
precise host cell used is not critical to the invention. The RNAi pathway protein can be 
produced in a prokaryodc host (e.g., E, coli or B. subtilis) or in a eukaryotic host (e.g., 
Saccharomyces or Pichia\ mammalian cells, e.g., COS. NIH 3T3 CHO, BHK, 293, or 

HeLa cells; or insect cells). 

1 0 Proteins and polypeptides can also be produced in plant cells. For plant cells viral 

expression vectors (e.g., cauliflower mosaic virus and tobacco mosaic virus) and plasmid 
expression vectors (e.g., Ti plasmid) are suitable. Such cells are available from a wide 
range of sources (e.g., the American Type Culture Collection, Rockland. MD; also, see, 
e.g., Ausubel et al.. Current Protocols in Molecular Biology, John Wiley & Sons, New 

1 5 York, 1 994). The methods of transformation or transfection and the choice of expression 
vehicle will depend on the host system selected. Transformation and transfection 
methods are described, e.g., in Ausubel ct al., supra ; expression vehicles may be chosen 
from those provided, c.g., in Cloning Vectors^ A Laboratory Manual (P.H. Pouwels et 
al., 1985. Supp. 1987). 

20 The host cells harboring the expression vehicle can be cultured in conventional 

nutrient media adapted as need for activation of a chosen gene, repression of a chosen 
gene, selection of transformants, or amplification of a chosen gene. 

One preferred expression system is the mouse 3T3 fibroblast host cell transfected 
with a pMAMneo expression vector (Clontech, Palo Alto, CA). pMAMneo provides an 

25 RSV-LTR enhancer linked to a dexamethasone-inducible MMTV-LTR promotor, an 
SV40 origin of replication which allows replication in mammalian systems, a selectable 
neomycin gene, and SV40 splicing and polyadenylation sites. DNA encoding an RNAi 
pathway protein would be inserted into the pMAMneo vector in an orientation designed 
to allow expression. The recombinant RNAi pathway protein would be isolated as 

30 described herein. Other preferable host cells that can be used in conjunction with the 
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pMAMneo expression vehicle include COS cells and CHO cells (ATCC Accession Nos. 
CRL 1 650 and CCL 61 . respectively). 

RNAi pathway polypeptides can be produced as fusion proteins. For example, the 
expression vector pUR278 (Ruther et al,, £WBO J. 2:1791, 1983), can be used to create 
5 lacZ fusion proteins. The pGEX vectors can be used to express foreign polypeptides as 
fusion proteins vdth glutathione S^transferase (GST). In general, such fusion proteins are 
soluble and can be easily purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. The pGEX vectors are 
designed to include thrombin or factor Xa protease cleavage sites so that the cloned target 
1 0 gene product can be released from the GST moiety. 

In an insect cell expression system, Autoerapha califomica nuclear polyhidrosis 
virus (AcNPV), which grows in Spodootera frugiperda cells, is used as a vector to 
express foreign genes. An RNAi pathway protein coding sequence can be cloned 
individually into non-essential regions (for example the polyhedrin gene) of the virus and 
1 5 placed under control of an AcNPV promoter, e.g., the polyhedrin promoter. Successful 
insertion of a gene encoding an RNAi pathway polypeptide or protein will result in 
inactivalion of the polyhedrin gene and production of non-occluded recombinant virus 
(i.e., virus lacking the proteinaceous coat encoded by the polyhedrin gene). These 
recombinant viruses are then used to infect spodoptera frugiperda cells in which the 
20 inserted gene is expressed (s^ e.g.. Smith et al., J. Virol. 46:584, 1983; Smith. U.S. 
Patent No. 4,215,051). 

In mammalian host cells, a number of viral*based expression systems can be 
utilized. When an adenovirus is used as an expression vector, the RNAi pathway protein 
nucleic acid sequence can be ligated to an adenovirus transcription/ translation control 
25 complex, e.g., the late promoter and tripartite leader sequence. This chimeric gene can 
then be inserted into the adenovirus genome by in vitro or in vivo recombination. 
Insertion into a non-essential region of the viral genome (e.g., region El or E3) will result 
in a recombinant virus that is viable and capable of expressing an RNAi pathway gene 
product in infected hosts feee, e.g., Logan, Proc. Natl. Acad. Sci. USA 81:3655, 1984). 
30 Specific initiation signals may be required for efficient translation of inserted 

nucleic acid sequences. These signals include the ATG initiation codon and adjacent 
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sequences. In cases where an entire native RNAi pathway protein gene or cDNA> 
including its own initiation codon and adjacent sequences, is inserted into the ^propriate 
expression vector, no additional translational control signals may be needed In other 
cases, exogenous translational control signals, including, perhaps, the ATG initiation 

5 codon, must be provided. Furthermore, the initiation codon must be in phase with the 
reading frame of the desired coding sequence to ensure translation of the entire insert. 
These exogenous translational control signals and initiation codons can be of a variety of 
origins, both natural and synthetic. The efficiency of expression may be enhanced by the 
inclusion of appropriate transcription enhancer elements, transcription terminators 

1 0 (Bittner et al,, Methods in EnzymoL 1 53 :5 1 6, 1 987). 

RNAi pathway polypeptides can be expressed directly or as a &sion with a 
heterologous polypeptide, such as a signal sequence or other polypeptide having a 
specific cleavage site at the N-and/or C-terminus of the mature protein or polypeptide. 
Included within the scope of this invention arc RNAi pathway polypeptides with a 

1 S heterologous signal sequence. The heterologous signal sequence selected should be one 
that is recognized and processed, i.e., cleaved by a signal peptidase, by the host cell. For 
prokaryotic host cells a prokaiyotic signal sequence is selected, for example, from the 
group of the alkaline phosphatase, penicillinase, Ipp, or heat-stable entorotoxin II leaders. 
For yeast secretion a yeast invertase, alpha factor, or acid phosphatase leaders may be 

20 selected* In mammalian cells» it is generally desirable to select a manunaiian signal 
sequences. 

A host cell may be chosen which modulates the expression of the inserted 
sequences, or modifies and processes the gene product in a specific, desired fashion. 
Such modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein 

25 products may be important for die function of the protein. Different host cells have 
characteristic and specific mechanisms for the post-translational processing and 
modification of proteins and gene products. Appropriate cell lines or host systems can be 
chosen to ensure the correct modification and processing of the foreign protein expressed. 
To this end, eukaryotic host cells that possess the cellular machinery for proper 

30 processing of the primary transcript, glycosylation, and phosphorylation of the gene 
product can be used. Such mammalian host cells include, but are not limited to, CHO, 
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VERO, BHK, HeLa, COS, MDCK, 293, 3T3, W08, and in particular, choroid plexus 
ceil lines. 

Alternatively, an RN Ai pathway protein can be produced by a stably-transfected 
mammalian cell line. A number of vectors suitable for stable transfection of mammalian 
5 cells are available to the public, see, e.g., Pouwels et al. feuora^ : methods for constructing 
such cell lines are also publicly available, e.g., in Ausubei et aL (supra) . In one example, 
cDNA encoding an RNAi pathway protein (e.g., RDE-1 or RDE-4) is cloned into an 
expression vector that includes the dihydrofoiate reductase (DHFR) gene. Integration of 
the plasmid and, therefore, the RNAi pathway protein-encoding gene into the host cell 

10 chromosome is selected for by including 0.01-300 pM methotrexate in the cell culture 
medium (as described in Ausubei et ai., supra) . This dominant selection can be 
accomplished in most cell types. 

Recombinant protein expression can be increased by DHFR-mediated 
amplification of the transfected gene. Methods for selecting cell lines bearing gene 

1 5 amplifications are described in Ausubei et al. (supra) : such methods generally involve 
expended culture in medium containing gradually increasing levels of methotrexate. 
DHFR-containing expression vectors commonly used for this purpose include pCVSEII* 
DHFR and pAdD26SV(A) (described in Ausubei et al., supra) . Any of the host ceUs 
described above or, preferably, a DHFR-deficient CHO cell line (e.g., CHO DHFR cells, 

20 ATCC Accession No. CRL 9096) are among the host cells preferred for DHFR selection 
of a stabiy-transfected cell line or DHFR*mediated gene amplification. 

A number of other selection systems can be used, including but not limited to the 
herpes simplex virus thymidine kinase, hypoxanthine-guanine phosphoribosyl- 
transferase, and adenine phosphoribosyltransferase genes can be employed in tk, hgprt^ or 

25 aprt cells, respectively. In addition, gpr, which confers resistance to mycophenolic acid 
(Mulligan et ah, Proc. Natl Acad ScL USA, 78:2072, 1981); neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Gar^in et al., J. MoL Biol^ 150:1, 
1981); and hygro, which confers resistance to hygromycin (Santerre et al.. Gene, 30:147, 
1981), can be used. 

30 Alternatively, any fusion protein can be readily purified by utilizing an antibody 

specific for the fiision protein being expressed. For example, a system described in 
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Janknecht et al., Proc, Natl Acad Sci USA, 88:8972 (1981), allows for the ready 
purification of non-denatured fusion proteins expressed in human cell lines. In this 
system, the gene of interest is subcloned into a vaccinia recombination plasmid such that 
the gene's open reading frame is translationally fused to an amino-terminal tag consisting 

5 of six histidine residues. Extracts from cells infected with recombinant vaccinia virus are 
loaded onto Ni^^ nitriloacetic acid-agarose columns, and histidine-tagged proteins are 
selectively eluted with imidazole-containing buffers. 

Alternatively, an RNAi pathway protein or a portion thereof, can be fused to an 
immunoglobulin Fc domain. Such a fusion protein can be readily purified using a protein 

10 A column. 

Antibodies that Recognize RNAi Pathway Proteins 

Techniques for generating both monoclonal and polyclonal antibodies specific for 
a particular protein are well known. The invention also includes humanized or chimeric 

1 5 antibodies, single chain antibodies. Fab fragments, F(ab02 fragments, and molecules 
produced using a Fab expression library. 

Antibodies can be raised against a short peptide epitope of an RNAi pathway gene 
(e.g., rde-l), an epitope linked to a known immunogen to enhance immunogenicity, a 
long fragment of an RNAi pathway gene, or the intact protein. Such antibodies are useful 

20 for e.g., localizing RNAi pathway polypeptides in tissue sections or fractionated cell 
preparations, determining whether an RNAi pathway gene is expressed (e.g., after 
transfection with an RNAi padiway gene), and evaluating the expression of an RNAi 
pathway gene in disorders (e.g., genetic conditions) where the RNAi pathway may be 
affected. 

25 An isolated RNAi pathway protein (e.g., RDE-1), or a portion or fragment 

thereof, can be used as an inununogen to generate antibodies that bind to an RNAi 
pathway protein using standard techniques for polyclonal and monoclonal antibody 
preparation. The RNAi pathway immunogen can also be a mutant RNAi pathway protein 
or a fra^ent of a mutant RNAi pathway protein. A fiiil-iength RNAi pathway protein 

30 can be used or, alternatively, antigenic peptide fragments of RNAi pathway protein can 
be used as immunogens. The antigenic peptide of an RNAi pathway protein comprises at 
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least i (preferably 10, 15, 20, or 30) amino acid residues. In the case of RDE-1, these 
residues are drawn from the amino acid sequence shown in SEQ ID N0:3 and encompass 
an epitope such that an antibody raised against the peptide forms a specific immune 
complex with RDE- 1 . Preferred epitopes encompassed by the antigenic peptide are 
5 regions of the protein that are located on the surface of the protein, e.g., hydrophiiic 
regions. 

An RNAi pathway protein immunogen typically is used to prepare antibodies by 
immunizing a suitable subject (e.g., rabbit, goat, mouse or other mammal) with the 
immunogen. An appropriate immunogenic preparation can contain, for example, 

1 0 recombinantly expressed RNAi pathway protein or a chemically synthesized RNAi 

polypeptide. The preparation can further mclude an adjuvant, such as Freund's complete 
or incomplete adjuvant, or similar immunostimulatoiy agent. Immunization of a suitable 
subject with an immunogenic RNAi pathway protein preparation induces a polyclonal 
anti-RNAi pathway protein antibody response. 

1 5 Polyclonal antibodies that recognize an RNAi pathway protein ("RNAi pathway 

antibodies") can be prepared as described above by immunizing a suitable subject with an 
RNAi pathway protein immunogen. The RNAi pathway antibody titer in the immunized 
subject can be monitored over time by standard techniques, such as with an enzyme- 
linked immunosorbent assay (ELISA) using immobilized RNAi pathway protein &om 

20 v^ch the immunogen was derived. If desired, the antibody molecules directed against 
the RNAi pathway protein can be isolated from the mammal (e.g., from the blood) and 
further purified by well-known techniques, such as protein A chromatography to obtain 
the IgG fraction. At an appropriate time after immunization, e.g., when the RNAi 
pathway antibody titers are highest, antibody-producing cells can be obtained from the 

25 subject and used to prepare monoclonal antibodies by standard techniques, such as the 
hybridoma technique originally described by Kohler and Milstein ( 1 975) Nature 
256:495-497, the human B cell hybridoma technique (Kozbor et al. (1983) Immunol. 
Today 4:72), the EBV-hybridoma technique (Cole et al. (1985), Monoclonal Antibodies 
and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology 

30 for producing hybridomas is well known (see generally Current Protocols in Immunology 
(1994) Coligan et al. (eds.) John Wiley & Sons, Inc., New York, NY). Briefly, an 
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immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) 
from a mammal immunized with an RNAi pathway immunogen as described above, and 
the culture supematants of the resulting hyMdoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds to the RNAi pathway protein. 
5 Any of the many well known protocols used for fusing lymphocytes and 

inunortalized cell lines can be applied for the purpose of generating a monoclonal 
antibody against an RNAi pathway protein (see, e.g.. Current Protocols in Immunology, 
supra; Galfre et al., 1977, Nature 266:55052; R.H. Kenneth, in Monoclonal Antibodies: 
A New Dimension In Biological Analyses^ Plenum Publishing Corp., New York, New 

10 York, 1980; and Lemer, 1981, Yale 1 Biol. Med., 54:387-402. Moreover, one in the art 
will appreciate that there are many variations of such methods which also would be 
useful. Hybridoma cells producing a monoclonal antibody of the invention are detected 
by screening the hybridoma culture supematants for antibodies that bind to the RNAi 
pathway protein, e.g., using a standard ELISA assay. 

1 5 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal RNAi pathway antibody can be identified and isolated by screening a 
recombinant combinatorial inununoglobulin library (e.g., an antibody phage display 
library) with an RNAi pathway protein to thereby isolate immunoglobulin library 
members that bind to the RNAi pathway protein. Kits for generating and screening 

20 phage display libraries are conunercially available (e.g., the Pharmacia Recombinant 
Phage Antibody System, Catalog No. 27-9400-01 ; and the Stratagene SurfZAP™ Phage 
Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screenmg antibody display library can be 
found in, for example, U.S. Patent No. 5,223.409; PCX Publication No. WO 92/18619; 

25 PCT Publication No. WO 91/17271; PCT Publication No. WO 92/20791; PCT 

Publication No. WO 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. 
WO 92/01047; PCT Publication No. WO 92/09690; PCT Publication No. WO 90/02809; 
Fuchs et al., 1991, Bio/Techtology 9:1370-1372; Hay et al., 1992, Hum. Antibod 
Hybridomas 2:^1-^5; HuseetaL, 1989, Sc/ewce 246:1275-1281; Griffiths etal., 1993, 

30 EMBOJ, 12:725-734. 
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Techniques developed for the production of "chimeric antibodies" (Morrison et 
al., Proc Natl Acad ScL USA, 81:6851, 1984; Neubergcr ct al., Nature, 312:604. 1984; 
Takeda et al., Nature, 314:452, 1984) can be used to splice the genes from a mouse 
antibody molecule of appropriate antigen specificity together with genes from a human 
5 antibody molecule of sq}propriate biological activity. A chimmc antibody is a molecule 
in which different portions are derived from different animal species^ such as those 
having a variable region derived from a murine mAb and a human immunoglobulin 
constant region. 

Alternatively, techniques described for the production of single chain antibodies 
1 0 (U.S. Patent 4,946,778; and U.S. Patents 4,946,778 and 4,704,692) can be adapted to 
produce single chain antibodies against an RNAi pathway protein or polypeptide. Single 
chain antibodies are formed by linking the heavy and light chain fragments of the Fv 
region via an amino acid bridge, resulting in a single chain polypeptide. 

Antibody fragments that recognize and bind to specific epitopes can be generated 
1 3 by known techniques. For example, such fragments can include but are not limited to 
FCabOi fragments, which can be produced by pepsin digestion of the antibody molecule, 
and Fab fragments, which can be generated by reducing the disulfide bridges of F{ab')2 
fragments. Alternatively, Fab expression libraries can be constructed (Huse et al.. 
Science, 246: 1275, 1989) to allow rapid and easy identification of monoclonal Fab 
20 fragments with the desired specificity. 

Identification of RNAi Pathwav Components 

RNAi pathway components can be identified in C. elegans and other animals 
(e.g., a mammal) using the methods described in the Examples below. Pathway 
25 components can also be identified using methods known in the art and the information 
provided herein. Such components include those involved in protein:protein and 
protein:RNA interactions. Specifically, RDE-l can be used to identify additional 
proteins and RNA molecules that bind to the RDE-1 protein and so facilitate genetic 
interference. 

30 The RNAi pathway mutant strains described herein (e.g., rde-1 , rde-2, rde-3, rde- 

4, and rdc-5; also mut-2 and mut-7) can be used in genetic screens to identify additional 
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RNAi pathway components. For example, a strain deficient for rde-i activity can be 
mutagenized and screened for the recovery of genetic interference. This type of screen 
can identify ailele-specific suppressors in other genes or second site mutations within the 
rde-1 gene that restore its activity* The resuidng-strains may define new genes that 
5 activate RNAi to overcome or bypass the rde-1 defect The mutations identified by these 
methods can be used to identify their coiresponding gene sequences. 

Two*hybrid screens can also be used to identify proteins that bind to RNAi 
pathway proteins such as RDE- 1 . Genes encoding proteins that interact with RDE-1 or 
human homologs of the C. elegans RDE-U identified using the two-hybrid method 

10 (Fields and Song.1989, Nature 340:245-246; Chien et al.. 1991, Proc. Natl Acad. ScL 
USA 88:9578-9582; Fields and Stemglanz, 1994. Trends Genet, 10:286-292; Bartcl and 
Fields, 1995, Methods EnzymoL 254:241-263). DNA encoding the RDE-1 protein is 
cloned and expressed from plasmids harboring GAL4 or iexA DNA-binding domains and 
co-transformed into cells harboring lacZ and HIS3 reporter constructs along with libraries 

15 of cDNAs that have been cloned into plasmids harboring the GAL4 activation domain. 
Libraries used for such co-transformation include those made from C elegans or a 
vertebrate embryonic cell. 

Mechanisms of Action of RNAi Pathway Components 

20 Specific cellular functions associated with the RNAi pathway include the specific 

targeting of a nucleic acid by a dsRNA, uptake of dsRNA^ transport of dsRNA, 
amplification of the dsRNA signal, and genetic interference. The mechanism of 
interference may involve translation inhibition, or interference with RNA processing. In 
addition, direct effects on the corresponding gene may contribute to interference. These 

25 mechanisms can be identified investigated using the methods described herein and 
methods known in the art. 



Methods of Screening for Molecules that Inhibit the RNAi Pathway 

The following assays are designed to identify compounds that are effective 
30 inhibitors of the RNAi pathway. Such inhibitors may act by, but are not limited to, 
binding to an RDE- 1 polypeptide (e.g., from C elegans^ mouse, or human), binding to 
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intracellular proteins that bind to an RNAi padiway component, compounds that intcrfae 
with the interaction between RNAi pathway components including between an RNAi 
pathway component and a dsRNA, and compounds that modulate the activity or 
expression of an RNAi pathway gene such as rde-1 . An inhibitor of the RNAi pathway 
5 can also be used to promote expression of a transgene. 

Assays can also be used to identify molecules that bind to RNAi pathway gene 
regulatory sequences (e.g., promoter sequences), thus modulating gene expression. See, 
e.g., Piatt, 1994,^ BioL Ghent 269:28558-28562. incorporated herein by reference in its 
entirety. 

1 0 The compounds which may be screened by the methods described herein include, 

but are not limited to, peptides and other organic compounds (e.g., peptidomimetics) that 
bind to an RNAi pathway protein (e.g.» that bind to an RD£-i), or inhibit its activity in 
any way. 

Such compounds may include, but are not limited to, peptides; for example, 

1 5 soluble peptides, including but not limited to members of random peptide libraries; (see, 
e.g.. Lam et al., 1991 , Nature 354:82-94; Houghten et al., 199\, Nature 354:84-86), and 
combinatorial chemistry-derived molecular libraries made of D-and/or L-amino acids, 
phosphopeptides (including, but not limited to, members of random or partially 
degenerate, directed phosphopeptide libraries; see e.g., Songyang et al., 1993, Cell 

20 72:767-778), and small organic or inorganic molecules. 

Organic molecules are screened to identiiy candidate molecules that af&ct 
expression of an RNAi pathway gene (e.g., rde-1), e.g., by interacting with the regulatory 
region or transcription factors of a gene. Compounds are also screened to identify those 
that affect the activity of such proteins, (e.g., by inhibiting rde-1 activity) or the activity 

25 of a molecule involved in the regulation of, for example, rde-1 . 

Computer modeling or searching technologies are used to identify compounds, or 
identify modifications of compounds that modulate the expression or activity of an RNAi 
pathway protein. For example, compounds likely to interact with the active site of a 
protein (e.g., RDE-1) are identified. The active site of an RNAi pathway protein can be 

30 identified using methods known in the art including, for example, analysis of the amino 
acid sequence of a molecule, from a study of complexes of an RNAi pathway, with its 
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native ligand (e.g., a dsRNA). Chemical or X-ray ctystallographic methods can be used 
to identiiy the active site of an RNAi pathway protein by the location of a bound ligand 
such as a dsRN A. 

The three-dimensional structure of the active site is determined. This can be done 
5 using known methods, including X-ray crystallography which may be used to determine 
a complete molecular structure. Solid or liquid phase NMR can be used to determine 
certain intra-molecuiar distances. Other methods of structural analysis can be used to 
determine partial or complete geometrical structures. Geometric structure can be 
determined with an RNAi pathway protein bound to a natural or artificial ligand which 

1 0 may provide a more accurate active site structure determination. 

Computer-based numerical modeling can also be used to predict protein structure 
(especially of the active site), or be used to complete an incomplete or insufficiently 
accurate structure. Modeling methods that may be used are, for example, parameterized 
models specific to particular biopolymers such as proteins or nucleic acids, molecular 

1 5 dynamics models based on computing molecular motions, statistical mechanics models 
based on thermal ensembles, or combined models. For most types of models, standard 
molecular force fields, representing the forces between constituent atoms and groups are 
necessary, and can be selected for the model from among the force fields known in 
physical chemistry. Information on incomplete or less accurate structures determined as 

20 above can be incorporated as constraints on the structures computed by these modeling 
methods. 

Having determined the structure of the active site of an RNAi pathway protein 
(e.g., RDE-1), either experimentally, by modeling, or by a combination of methods, 
candidate modulating compounds can be identified by searching databases containing 

25 compounds along with information on their molecular structure. The compounds 
identified in such a search are those that have structures that match the active site 
structure, fit into the active site, or interact with groups defining the active site. The 
compounds identified by the search are potential RNAi pathway modulating compounds. 
These methods may also be used to identify improved modulating compounds 

30 from an already known modulating compound or ligand. The structure of the known 
compound is modified and effects are determined using experimental and computer 
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modeling methods as described above. The altered structure may be compared to the 
active site structure of an RNAi pathway protein (e.g., an RDE-1) to determine or predict 
how a particular modification to the iigand or modulating compound will affect its 
interaction with that protein. Systematic variations in composition, such as by vaiying 

5 side groups^ can be evaluated to obtain modified modulating compounds or ligands of 
preferred specificity or activity. 

Other experimental and computer modeling methods useful to identify 
modulating compounds based on identification of the active sites of an RNAi pathway 
protein and related transduction and transcription factors will be apparent to those of skill 

1 0 in the art. 

Examples of molecular modeling systems are the QUANTA programs, e.g., 
CHARMm, MCSS/HOOK, and X-LIGAND, (Molecular Simulations, Inc., San Diego, 
CA). QUANTA analyzes the construction, graphic modeling, and analysis of molecular 
structure. CHARMm analyzes energy minimization and molecular dynamics functions. 

1 5 MCSS/HOOK characterizes the ability of an active site to bind a ligand using energetics 
calculated via CHARMm. X-LIGAND fits ligand molecules to electron density of 
protein-ligand complexes. It also allows interactive construction, modification, 
visualization, and analysis of the behavior of molecules with each other. 

Articles reviewing computer modeling of compounds interacting with specific 

20 protein can provide additional guidance. For example, see Rotivinen et al., 1 988, Acta 
Pharmaceutical Fennica 97:159-166; Ripka, New Scientist June 16, 1988 pp.54.57; 
McKinaly and Rossmann, 1989, Ann. /Jev. Pharmacol Toxicol 29:1 1 1-122; Percy and 
Davies. OSAR Quantitative Structure -Activity Relationships in Drug Design pp. 189- 
193 (Alan R. Liss, Inc., 1989); Lewis and Dean, 1989, Proc. R, Soc Lond 236:125-140, 

25 1 4 1 - 1 52; and, regarding a model receptor for nucleic acid components. Askew et al.. Am. 
J, Chem. Soc. 1 1 1 :1082-1 090. Computer programs designed to screen and depict 
chemicals are available from companies such as MSI {supra), AUelix, Inc. (Mississauga, 
Ontario, Canada), and Hypercube, Inc. (Gainesville, FL). 

These applications are largely designed for drugs specific to particular proteins; 

30 however, they can be adapted to the design of drugs specific to identified regions of DNA 
or RNA. Chemical libraries that can be used in the protocols described herein include 



24 



WO0I/290S8 



PCT/USOO/28470 



those available, e.g., from ArQuIe, Inc. (Medford, MA) and Oncogene Science, Inc. 
(Uniondale, NY). 

In addition to designing and generating compounds that alter binding, as 
described above, libraries of kno%vn compounds,- including natural products, synthetic 
5 chemicals, and biologically active materials including peptides, can be screened for 
compounds that are inhibitors or activators of the RNAi pathway components identified 
herein. 

Compounds identified by methods described above can be used, for example, for 
elaborating the biolo^cal function of RNAi path^vay gene products (e.g., an RDE-l^ and 
10 to treat genetic disorders involving an RNAi pathway protein. Assays for testing the 
effectiveness of compounds such as those described herein are further described below. 

In vitro Screening Assays for Compounds that Bind to RNAi Pathway Proteins and 
Genes 

15 Jn vitro systems can be used to identify compoimds that interact with (e.g., bind 

to) RNAi pathway proteins or genes encoding those proteins (e.g,, rde-I and its protein 
product). Such confounds are useful, for example, for modulating the activity of these 
entities, elaborating their biochemistry, treating disorders in which a decrease or increase 
in dsRNA mediated genetic interference is desired. Such compounds may also be useful 

20 to treat diseases in animals, especially humans, involving nematodes, e.g., trichinosis, 
trichuriasis, and toxocariasis. Compounds such as those described herein may also be 
useful to treat plant diseases caused by nematodes. These compounds can be used in 
screens for compounds that disrupt normal function, or may themselves disrupt normal 
function. 

25 Assays to identify compounds that bind to RNAi pathway proteins involve 

preparation of a reaction mixture of the protein and the test compound under conditions 
sufficient to allow the two components to interact and bind, thus forming a complex 
which can be removed and/or detected. 

Screening assays can be performed using a number of methods. For example, an 

30 RNAi pathway protein from an organism (e.g., RDE-1), peptide, or fusion protein can he 
immobilized onto a solid phase, reacted with the test compound, and complexes detected 

25 
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by direct or indirect labeling of the test compotuid. Alternatively^ the test compound can 
be immobilized, reacted with the RNAi pathway molecule, and the complexes detected. 
MicFottter plates may be used as the solid phase and the immobilized component 
anchored by covalent or noncovalent interactions. Non*covalent attachment may be 
5 achieved by coating the solid phase with a solution containing the molecule and drying. 
Aitematively, an antibody, for example, one specific for an RNAi pathway protein such 
as RDE-1 is used to anchor the molecule to the solid surface. Such surfaces may be 
prepared in advance of use, and stored. 

In these screening assays, die non-immobilized component is added to the coated 

1 0 surface containing the immobilized component under conditions sufficient to permit 
interaction between the two components. The imreacted components are then removed 
(e.g., by washing) under conditions such that any complexes formed will remain 
immobilized on the solid phase. The detection of the complexes may be accomplished by 
a number of methods known to those in the art. For example, the nonimmobilized 

1 5 component of the assay may be prelabeled with a radioactive or enzymatic entity and 
detected using appropriate means. If die non-immobilized entity was not prelabeled, an 
indirect method is used For example, if the non-immobilized entity ts an RDE-1, an 
antibody against the RDE-1 is used to detect the bound molecule, and a secondary, 
labeled antibody used to detect the ^tire complex. 

20 Alternatively, a reaction can be conducted in a liquid phase, the reaction products 

separated from unreacted components, and complexes detected (e.g., using an 
immobilized antibody specific for an RNAi pathway protein). 

Cell-based assays can be used to identify compounds that interact with RNAi 
pathway proteins. Cell lines that naturally express such proteins or have been genetically 

25 engineered to express such proteins (e.g., by transfection or transduction of an rde-l 
DNA) can be used For example, test compounds can be administered to cell cultures 
and the amount of mRNA derived from an RNAi pathway gene analyzed, e.g., by 
Northern analysis. An increase in the amount of RNA transcribed from such a gene 
compared to control cultures that did not contain the test compound indicates that the test 

30 compound is an inhibitor of the RNAi padiway. Similarly, the amount of a polypeptide 
encoded by an RNAi pathway gene, or the activity of such a polypeptide, can be analyzed 

26 
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in the presence and absence of a test compound. An increase in the amount or activity of 
the poIypq}tide indicates that the test compound is an inhibitor of the RNAi pathway. 

Ectopic Expression of an RNAi Pathway Gene . 
5 Ectopic expression (i.e.> expression of an RNAi pathway gene in a cell where it is 

not normally expressed or at a time when it is not nomially expressed) of a mutant RNAi 
pathway gene (i.e., an RNAi pathway gene that suppresses genetic interference) can be 
used to block or reduce endogenous interference in a host organism. This is useful, e.g., 
for enhancing transgene expression in those cases where the RNAi pathway is interfering 

10 with expression of a transgene. Another method of accomplishing this is to knockout or 
down regulate an RNAi pathway gene using methods kno^vn in the art. These methods 
are useful in both plants and animals (e.g., in an invend)rate such as a nematode, a 
mouse, or a human). 

Ectopic expression of an RNAi pathway gene, e.g., rde-l or rde'4 can also be 

1 5 used to activate the RNAi pathway. In some cases, targeting can be used to activate the 
pathway in specific cell types, e.g., tumor cells. For example, a non- viral RNAi pathway 
gene construct can be targeted in vivo to specific tissues or organs, e.g., the liver or 
muscle, in patients. Examples of delivery systems for targeting such constructs include 
receptor mediated endocytosis, liposome encapsulation (described below), or direct 

20 insertion of non- viral expression vectors. 

An example of one such method is liposome encapsulation of nucleic acid. 
Success&l in vivo gene transfer has been achieved with the injection of DNA, e.g., as a 
linear construct or a circular plasmid, encapsulated in liposomes (Ledley, Human Gene 
Therapy 6:1 129-1 144 (1995) and Farhood, et aL, Ann. NY Acad. Sci. 716:23-35 (1994)). 

25 A number of cationic liposome amphiphiles are being developed (Ledley, Human Gene 
Therapy 6:1 129-1 144 (1995); Farhood, ct aL, Ana NY Acad. Sci., 716:23-35 (1994) that 
can be used for this purpose. 

Targeted gene transfer has been shown to occur using such methods. For 
example, intratracheal administration of cationic lipid-DNA complexes was shovm to 

30 effect gene transfer and expression in the epithelial ceils lining the bronchus (Brigham, et 
al.. Am. J. Rcspir. Cell Mol. Biol. 8:209-213 (1993); and Canonico, et al., Am. J. Respir, 
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Cell MoL Biol. 10:24-29 (1994)). Expression in pulmonary tissues and the endothelium 
was reported after intravenous injection of the complexes (Brigham, et al.. Am, J. Respir. 
CellMol. Biol. 8:209-213 (1993); Zhu,ctaL, Science. 261:209-211 (1993); Stewart, ct 
al.. Human Gene Therapy 3:267-275 (1992); Nabel, et al.. Human Gene Therapy 3:649- 
5 656 (1992); and Canonico, et al., J. AppL Physiol. 77:415-419 (1994)). An expression 
cassette for an RNAi pathway sequence in linear, plasmid or viral DNA forms can be 
condensed through ionic interactions with the cationic lipid to form a particulate complex 
for in vivo delivery (Stewart, et al.. Human Gene Therapy 3:267-275 (1992)), 

Other liposome formulations, for example, proteoliposomes which contain viral 
1 0 envelope receptor proteins, i.e., virosomes, have been found to eifectively deliver genes 
into hepatocytes and kidney cells aftex direct injection (Nicolau, et al., Proc. Natl. Acad. 
Sci. USA 80:1068-1072 (1993); Kaneda, et al.. Science 243:375^378 (1989); Mannino, et 
al., Biotechniques 6:682 (1988); and Tomita, et al., Biochem. Biophys. Res. Comm. 
186:129^134 (1992))- 

1 5 Direct injecdon can also be used to administer an RNAi pathway nucleic acid 

sequence in a DNA expression vectors, e.g., into the muscle or liver, either as a solution 
or as a calcium phosphate precipitate (Wolff, et al. Science 247:1465-1468 (1990); 
Ascadi, et al.. The New Biologist 3:71-81 (1991); and Benvenisty, et al, Proc. Natl. 
Acad. Sci. USA 83:9551-9555 (1986). 

20 

Preparation of RNAi Agents 

RNAi pathway components can be used to prepare RNAi agents. Such agents are 

dsRNAs that have been treated with RNAi pathway compKsnents rendering the treated 

dsRNA capable of activity in the RNAi pathway and can be used as sequence-specific 
25 interfering agents useful for targeted genetic interference. Specifically, treating a dsRNA 

with an RDE-1 and RDE-4 is useful for making an RNAi agent. An RNAi agent can be 

produced by preincubating a dsRNA in vitro in the presence of RDE*1 and RDE-4. 

Another method of preparing an RNAi agent is to activate the RNAi pathway in a 

target cell (i.e., a cell in which it is desirable to activate the RNAi pathway such as a 
30 tumor cell) by transgenesis of an rde-1 coding sequence and an rde-4 coding sequence 

into the target cell. 

28 



wo 01/29058 



PCT/t'SOO/28470 



RNAi pathway polypeptides can be modified, e.g., to enhance their stability or 
cellular uptake, by attaching lipophilic or other helper groups to the polypeptide, by the 
fonnation of chimeras with proteins or other moieties that are taken up by cells, or by the 
use of liposomes or other techniques of drug delivery known in the art. 

In C. elegans, RNAi agents appear to spread from cell to cell, thus, active RNAi 
agents can diffuse or be actively transported from conditioned media or serum directly 
into target cells. Alternatively, RNAi agents can be injected into an organism or cell. 
They may also be incorporated into a cell using liposomes or other such methods known 
in the art 

Such methods are useful for stimulating the RNAi pathway in C. elegans cells, 
and in heterologous ceils including plants and vertebrate ceils. Such methods are useiul 
in manunalian, e.g., human cells. 

Enhanced Delivery of a Cargo Compound 

RNAi pathway components that mediate the transport of dsRNA into cells and 
tissues can be used to promote the entry of dsRNA into ceils and tissues, including 
dsRNA that is linked to another compound. The method is accomplished by linking 
dsRNA to a cargo compound (e.g., a drug or DNA molecule), e.g., by a covalent bond. 
The endogenous RNAi pathway gene expressing dsRNA transport fruiction is activated 
using methods known in the art. Alternatively, other methods can be used such as 
transfecting the target cell with the gene that affects transport thus permitting the cell or 
tissue to take up the dsDNA. 

Examples 

The invention is further described in the Examples below which describe methods 
of identifying mutations in the RNAi pathway and methods of identifying genes encoding 
components of the RNAi pathway. 

Example 1 : Strains and Alleles 

The Bristol strain N2 was used as standard wild-type strain. The marker 
mutations and deficiencies used are listed by chromosomes as follows: LGI: dpy- 
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I4(eJ88), unc-l3(e5l)\ LGffl: dp}hl7(el64), unc-32(eI89); LGV: dp)hJI(e224), unc- 
42(e270h dafJI(m87X eDfl, mDJ3, nDftl. sD/29, sDflS. mc-76(e911). The C. elegans 
strain DP 13 was used to generate hybrids for STS linkage-mapping (Williams et al., 
1 992. Generics 1 3 1 :609-624), 
5 Sensitivity to RNAi was tested in the following strains. MT3 126: mut'2(r459) 

(obtained from John Collins, Department of Biochemistry & Molecular Biology, 
University of New Hampshire, Durham, NH); dp}hl9(nl347), 1V/4\0:mvt-2(r459) sent- 
4(nl378), NL917: mut-7(pk204), SS552: mes-2(bn76) rol-}(e91)/mnCl (obtained from S. 
Strome, Biology Dept., Indiana University) , SS449: meS'3(bn88) dpy'5(e61) (from S. 
1 0 Strome, supra)\ hDp20, SS268: dpy-II(e224) meS'4(bn23) unc-76(e9JI)/nTl, SS360: 
mes'6(bn66) dpy-20(eI282)/nTJ, CB879: him-l{e879). A non-Unc mut-6 strain used was 
derived from RW7096: mut-6(st702) mC'22(stJ92::Tc\\ due to the loss of Tel insertion 
in unC'22, 

Homozygous mutants of mut'6, mes-2y i, 4^ 6 and him-l showed sensitivity to 
1 5 RNAi by injection of pos-l dsRNA. The dose of injected RNA was about 0.7mg/mL 
This dose lies within the range where reduced concentration leads to reduced interference 
effects. The results of the injection oipos-J dsRNA into these mutants (dead embryos / 
Fl progeny) were as follows: mut-6: 422/437, mes-2: 781/787, meS'3: 462/474, mes'4: 
810/814, 900/1,002, him-J: 241/248, N2 (control): 365/393. 

20 To test mutator activity, a mutant that was caused by Tc4 transposon insertion 

was used: TRl 175: unC'22(r765::Tc4y Strains TW410 and TRl 175 were gifts from Q. 
Boese and J. Collins (Department of Biochemistry & Molecular Biology, University of 
New Hampshire, Durham, NH). 

25 Example 2: RNA interference assay 

Genetic interference using RNAi administered by microinjection was perfonned 
as described in Fire et al., 1998, supra and Rocheleau et al., 1997, Cell 90:707-716, pos- 
I cDNA clone yk6 1 hi . par-2 cDN A clone yk96h7, 5qt'3 cDNA clone yk75f2 were used 
to prepare dsRN A in vitro. These cDNA clones were obtained from the C ekgans 

30 cDNA project (Y. Kohara, Gene Network Lab, National Institute of Genetics, Mishima 
411, Japan). 

30 
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Genetic interference using RNAi administered by feeding was performed as 
described in Tiramons and Fire, 1998, Nature 395:854. pos-l cDNA was cloned into a 
plasraid that contains two T7 promoter sequences arranged in head-to-head configuration. 
The plasmid was transformed into an E. coli strain, BL21(DE3X and the transfonned 

S bacteria were seeded on NGM (nematode growth medium) plates containing 60iig/ml 
ampiciUin and SO^ig^ml IPTG. The bacteria were grown overnight at room tempemture 
to induce pos-l dsRNA. Seeded plates (BL21(DE3)[dsRNA] plates) stored at 4**C 
remained effective for inducing interference for up to two weeks. To test RNAi 
sensitivity, C. elegans larvae were transferred onto BL21(DE3)[dsRNA] plates and 

10 embryonic lethality was assayed in the next generation. 

Transgenic lines expressing interfering RNA for unc-22 were engineered using a 
mixture of three plasmids: pPD[L421 8] (unc-22 antisense segment, driven by myo-3 
promoter); pPD[L42 1 8] (corresponding unc'22 sense segment, driven by myo-i 
promoter); pRF4 (semidominant transformation marker). DNA concentrations in the 

15 injected mixture were lOO^g/mi each, hijections were as described (Meilo et al.» 1991 , 
EMBOJ, 10:3959; Mello and Fire, 1995, Methods in Cell Biol 48:451-482), 

Example 3: Identification of RNAi-Deficient Mutants 

A method of screening for mutants defective in the RNAi pathway was devised 

20 that would permit the large-scale application of dsRNA to mutagenized populations. 
Feeding worms K coli which express a dsRN A, or simply soaking worms in dsRNA 
solution, are both sufficient to induce interference in C. elegans (Tmunons and Fire, 
1998, supra\ Tabara et al., 1998, Science 282:430-431). To carry out a selection, the 
feeding method was optimized to deliver interfering RNA for an ess^tial gene,po5-i. 

25 C elegans hermaphrodites that ingest bacteria expressing dsRNA conresponding to a 
segment of pos-l are themselves unaffected but produce dead embryos with the 
distinctive pos-1 embryonic lethal phenotype. 

To identify strains defective in the RNAi padiway, wild-type animals were 
mutagenized, backcrossed, and the F2 generation examined for rare individuals that were 

30 able to produce complete broods of viable progeny. Chemical mutagenesis was used to 
generate the mutations as well as spontaneous mutations arising in the mut-6 strain in 
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which Tel transposons are activated (Mori et ai., 1988, Genetics 120:397-407). To 
facilitate screens for mutations, an egg laying starting strain was used. In the absence of 
egg laying, the F3 progeny remained trapped within the mother's cuticle. Candidate 
mutants had internally hatched broods of viable embryos and were thus easily 

S distinguished from the background population of individuals filled primarily with dead 
embryos (Figure 1 A). Candidates were then re-tested for resistance to injected dsRNA. 

The genetic screen used to isolate RNAi pathway mutants was similar to one 
designed by James R. Preiss for the identification of maternal effect mutants (Kemphues 
et al., 1988, Ceil 52:31 1-320). An Egl strain^ lm-2(elS09) was mutagenized with EMS 

10 and the F2 genemtion was cultured on a bacterial lawn expressing pos-1 dsRNA. 
Mutagenized populations were then screened for rare individuals that were able to 
produce complete broods of viable progeny forming a distinctive *^bag of worms** 
phenotype. To make sure that the animals were truly resistant to RNAi, candidate strains 
were next assayed for resistance to RNAi by injection. Independent EMS induced alleles 

1 5 of rde-i were found in two separate pools of mutagenized animals at a frequency of 
approximately one allele in 2,000 to 4,000 haploid genomes. 

In addition, a search was made for spontaneous mutants using a mut*6 strain in 
which Tel transposons are activated (Mori et al., 1988). 100,000 mta-S; Un-l animals 
(Mello et al., 1994) were cultured on bacteria expressing poj-/ dsRNA. After one 

20 generation of growth, surviving animals were transferred again to plates with bacteria 
expressing the dsRNA and screened for resistant mutants. Three resulting strains were 
genetically mapped. One of these strains (ne300) mapped to LGV and failed to 
complement rde-l(ne219). Two strains ne299 and nelQl mapped to LGIII and defme the 
rde-4 complementation group. Because the screen was clonal in nature and involved 

25 rounds of enrichment it is possible that both rde'4 strains are related. 

Seven mutant strains were selected for genetic mapping. These seven mutants 
defined four complementation groups; rde-ly with three alleles, rde-4^ with two alleles, 
and rde*2^ and rde-S^ with one allele each (Figure I B). 

To map the RNAi defective mutations, the RNAi resistant phenotype was assayed 

30 either by feeding bacteria expressing pos-l dsRNA or by injection of a dsRNA mixture of 
poS'J md unc-22. The same assays were used for complementation tests. In vivo 
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expression of mC'22 dsRNA was also used for mapping of rde-L Mapping with visible 
marker mutations was performed as described in Brenner (1974, GeneticSy 77:71-94) and 
mapping with STS marker was performed as described in Williams et al. (1 992, supra). 
ne2J9, ne297 and neSOO failed to complement each other, defining the rck-J 
5 locus, rde-l mutations mapped near unc-42 V. Three factor mapping was used to locate 
rde*l(neSOO) one eighth of the distance from unc-42 in the urtc-42/dqfII interval (3/24 
Unc-non-Daf recombinants analyzed). The rde-IfneSOO) allele complemented the 
chromosomal deficiency sDf29 and failed to complement eDfJ, mDJ3, nDfil and sDfl5, 
rde'2(ne221) and rde-5(ne298) mapped near unc-JS L rde-2 complemented rde-S, rde- 

1 0 4(ne299) and (neSOI) mapped near unC'69 III and failed to complement each other. 
ne299 complemented mut'7(pk204). 

The rde'l(-^) activity is sufficient maternally or zygoticaily. To test the maternal 
sufGciency, animals heterozygous for rde'l(ne219) were injected with dsRNA targeting 
the zygotic gene, sqt'3, and self progeny were assayed for the Sqt phenotype. 100% of 

1 5 the self progeny including rde^l homozygous progeny were found to exhibit the Sqt 
phenotype. Thus, maternally provided rde-J(-^) activity is sufficient to mediate 
interference with a 2ygotic target gene. Zygotic sufiSciency was assayed by injecting 
homozygous rde-J mothers with dsRNA targeting the zygotic mC'22 gene (Figure 3). 
. Injected animals were allowed to produce self-progeny or instead were mated after 12 

20 hours to wild-type males, to produce heterozygous rde/+ cross-progeny. Each class of 
progeny was scored for the unC'22 twitching phenotype as indicated by the fraction 
shown if Figure 3 (Unc progeny/total progeny). The injected animals were then mated 
with wild-type males. Self progeny from homozygous injected mothers were unaffected, 
however, 68% of the cross progeny were Unc. This result indicates that zygoticaily 

25 provided rde-J (+) activity is also sufficient. However both maternal and zygotic rde- 
activity contribute to zygotic interference as 100% of progeny from wild-type 
injected mothers exhibit mC'22 interference (606/606). Thus, rde'l(-^) and rde-4('^) 
activities are not needed for dsRNA uptake, transport or stability. 

RNAi sensitivity of several existing C. elegans mutants was also examined. Most 

30 of these mutant strains were fully sensitive to RNAi. However, RNAi resistance was 
identified in two strains that had previously been shown to exhibit elevated levels of 



33 



wo 01/29058 



PCTAJSOO/28470 



transposon mobilization (mutator strains): miu-2 (described in Collins et al., 1987, Ncaure 
328:726-728) and mtu-? (described in Ketting et al.. Ceil, in press for release on October 
15, 1999). Another mutator strain, mut'6(st702), wzs M\y sensitive to RNAi, Since 
mutator strains continually accumulate mutations, the resistance of m«/-2 and mut-7 may 

5 have been due to the fwesence of secondary mutations. To test this possibility we 
examined the genetic linkage between the mutator and RNAi resistance phenotypes of 
mta-2 and mut-7. We found that independently outcrossed mut'2(r459) mutator strains 
TW4 1 0 and NfD 1 26 both showed resistance to RNAi. We mapped the RNAi resistance 
phenotype of mui'7(pk204) to the center of linkage group HI (Figure IB), the position 

1 0 that had been defined for the mutator activity of mut-7(pk204) by Ketting et al. (supra). 
Together, these observations indicate that the RNAi resistance phenotypes of the mu('2 
and mut-7 strains are genetically linked to their mutator activities. Animals heterozygous 
for the rde and mut alleles were generated by crossing wild-type males with Unc-Rde or 
Unc-Mut hermaphrodites. The rde and mut mutations appeared to be simple recessive 

1 5 mutations with the exception of mut-2(r459), which appeared to be weakly dominant 
(Figure 2A). 

These data demonstrate that some genes are non-essential (e.g., rde-1 and rde'4). 
This method can be used to identify additional mutations in RNAi pathway genes. 

20 Example 4: Identification of Properties of RNAi-Deficicnt Mutants 
Effects of rde mutations in germiine and somatic tissues 
Microinjection was used to assay the sensitivity of each rde strain to several 
distinct dsRNA species. The pos-J and genes are expressed in the maternal 
germiine and are required for proper embryonic development (Tabara et al., 1999, 

25 Development 126:1-1 1; Boyd et al., 1996, Development 122:3075-3084), All rde- strains 
tested (as well as mut-2 and mut'7) showed significant resistance to dsRNA targeting of 
these germline-specific genes (Figure 28), as well as to several other germiine specific 
genes tested. The rde-S data (asterisk in Figure 2B) includes a 10% non-specific 
embryonic lethality present in the rde-J strain. 

30 To examine the effect of these mutations on genetic interference of somatically 

expressed genes, cells were injected with dsRNA targeting the cuticle collagen gene sqt-i 
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and the body muscle structural gene unc-ll. sqt-i hypomorphic mutants exhibit a short, 
dumpy body shape (dpy; van der Keyl et al., 1994, Dev. Dyn. 201:86-94). unC'22 
mutations exhibit severe paralysis with a distinctive body twitching phenotype (Moerman 
et al., 1986, Proc. Natl Acad Sci. USA 83:2579-2583). rde-I, rde-S, rde'4 and mwr-i 

5 strains showed strong resistance to both sqt-3 and unC'22 dsRNA, while rde'2 and mut-? 
strains showed partial resistance. Thus rde-2 and mut-J appeared to be partially tissue- or 
gene-specific in that they were required for effective RNAi against geimline but not 
somatically expressed genes. The nfe-i, rde-3, rde'4, and otm/-2 (+) activities appeared 
to be required for interference for all genes analyzed. The rde and mut strains differ from 

1 0 one another in sensitivity to sqt-2 dsRN A. 



Effect of rde on transposon mobilization 

The effect of rde mutations on transposon mobilization was examined. Two of 
the newly identified mutants, rde'2 and rde-3 exhibited a level of transposon activation 
15 similar to that of mut"? (Table l). In contrast, transposon mobilization was not observed 
in the presence of rde- 1 or rde-4 (Table 1). 
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TABLE 1 : TRANSPQSQN MOBILIZATION AND MALE INCIDENCE IN rde AND 
mut STRAINS 



Percentage of Non^Uac Revertants 


unc-22rr765::Tc4) 


0 (0/2000) 


rde-1 (ne219); 
unc-22 (r765::Tc4) 


0 (0/4000) 


rde-2(iie221; 
unc-22 (r765::Tc4) 


0.96 (8/830) 


rde-3 (ne298); 
unc-22 (r765::Tc4) 


1.6(35/2141) 


rde-4 (ne299); 
unc.22 (r765::Tc4) 


0 (0/2885) 


mut'7 (pk204); 
unc-22 (r765::Tc4) 


1.0(40/3895) 


Percentage of Male Animals 


Wild type (n2) 


0.21 (2/934) 


rde-1 (ne219) 


0.07(1/1530) 


rde-2(ne221) 


3.2 (25/788) 


rde'3 (ne298) 


7.8 (71/912) 


rdc-4 (ne299) 


0.24 (5/2055) 



5 X'Chromosome loss 

Mutator strains (including mM/-2, mut-T) rde-2 and rde-3) exhibit a second 
phenotype: a high incidence of males reflecting an increased frequency of X* 
chromosome loss during meiosis (Collins et al., 1987, supra; Kdtting et al., supra). This 
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phenotypc was observed in rde-2 and rde-S strains, but not observed in the rde-I and rde- 
4 strains which showed a wild^type incidence of males (Table 1). 

A previously described gene-silencing process appears to act on transgenes in the 
gennliite of C elegans. Although the silencing mechanisms arc not well understood, 

5 they are known to depend on the products of the genes mes'2, i, ^ and ^ (Kelly and Fire, 
1998, Development 125:2451-2456). To examine the possibility that the RNAi and 
gerraline transgene-silencing might share common mechanistic features, we first asked if 
the mes mutants were resistant to RNAi. Wc foimd normal levels of RNA interference in 
each of these strains. We next asked if RNAi deficient strains were defective in 

10 transgene-silencing. Three strains were analyzed: mut-7Q)k204), rde-l(ne2I9) Bndrde- 
2(ne22I). 

To analyze transgene silencing in mut-? wonns, homozygous mut-7 lines carrying 
various GFP reporters transgenes were generated as follows: N2 (Bristol strain) males 
were mated to ^k204) unc'32 (el 89) hermaphrodites; cross progeny males were 

1 5 then mated to strains carrying the GFP transgenes. mut-J unc'32/^'^ cross progeny from 
these matings were cloned, and mut-J unc'32 homozygous animals carrying the 
transgenes were isolated fi-om their self-progeny. After the GFP reporter transgenes were 
introduced mto different genetic backgrounds, activation of GFP transgene expression in 
germ cells was assayed at 250C by fluorescence microscopy. The tested GFP reporter 

20 transgenes were each active in some or all somatic tissues, but had become silenced in the 
germline. The plasmids used and transgene designations are as follows: 1 ) pBK48 
which contains an in-fiame insertion of GFP into a ubiquitously expressed gene, letSSS 
(Kelly, et al., 1997. Genetics 146:227-238). ccExPD727} contains more tiian 100 copies 
of pBK48 in a high copy repetitive array that is caiiied extmchromosomally. 2) pJH3.92 

25 is an in-frame fusion of GFP with the maternal pie-l gene (M. Durm and G. Seydoux, 
Johns Hopkins University, Baltimore, MD). jhExI070 carries pJH3,92 in a low copy 
"complex" extrachromosomal array generated by the procedure of Kelly et al. (1997, 
jupra) pJKL380.4 is a fusion of GFP with the C. elegans nuclear laminin gene, /am-/, 
which is expressed in all tissues (J. Liu and A. Fire). ccln4810 carries pJKL380.4 in a 

30 complex array that has been integrated into the X chromosome by gamma irradiation 
using standard techniques. 
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The mut'7 strain was analyzed most extensively and was found to exhibit 
desilencing of three different germline transgenes tested (Table 2), The rde'2 strain 
exhibited a similar level of desilencing for a single transgene. In contrast, no transgene 
desilencing was observed in rde-I mutants (Table 2), Thus, mut-7 and which differ 
5 from rde^J in having transposon mobilization and a high incidence of X-chromosome 
loss also differ from rde-I in their ability to partially reactivate silent germline 
transgenes. 



TABLE 2 : REACTIVATION OF SILENCED TRANSGENES IN THE GERMLINE OF 
10 mut-7fpk204) 



Genotype Transgene Array Percentage of 

Germline 
Desilencing 


+/+• 


ccEx7271 


8.3 (4/48) 


mut-71+ 


ccEx7271 


14.5 (7/48) 


mut-7/mut-7 


ccEx7271 


91.0(71/78) 


+/+ 


jhExl070 


3.9 (2/51) 


mut-7/mut-7 


jhExl070 


86.5 (32/37) 


+/+ 


ccin4810 


4.3 (2/46) 


mut-7/mut-7 


ccin4810 


73.3 (33/45) 


rde-l/rde-1 


ccEx7271 


0 (0/34) 



15 Example 5: Requirement for rde-lM and rde'4f^) Activities in Target Tissue 

The rde-1 and rde-A mutants differ from other RNAi deficient strains identified 
herein in that they do not cause transposon mobilization nor do they cause chromosome 
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loss. The role of these genes in upstream events such as dsRNA uptake, transport or 
stability was examined. Such events could be required for interference induced by 
exogenous trigger RN As but might be dispensable for natural functions of RNAi. To 
evaluate diese upstream events, rde-1 and r^e-^liomozygotes were exposed to dsRNA. 
5 The next generation was scored for interference. dsRNA targeting the mc'22 gene was 
injected into the intestinal cells of homozygous rde-l and rde-4 hermaphrodites and the 
injected animals were then mated to wild-type males (Figure 3). The self-progeny for 
both strains exhibited no interference with the targeted gene. However, there was potent 
interference in the r<fe-7/+ and cross progeny (Figure 3). These observations 

10 indicated that rde-I and rde-4 mutants have intact mecham'sms for transporting the 
interference effect from the site of injection (the intestine) into the embryos of the 
injected animal and then into the tissues of the resulting progeny. The stabiUty of the 
resulting interference also appeared to be normal in rde-I and rde^ as the homozygous 
injected mothos continued to produce affected cross progeny for several days after the 

15 time of injection. 

To examine whether rde-1 and rde'4 mutants could block interference caused by 
dsRNA expressed directly in die target dssue, the muscle*specific promoter from the 
myo-i gene (Dibb et al., 1989, J. Moi BioL 205:605-613) was used to drive the 
expression of both strands of the muscle structural gene unc'22 in the body wall muscles 

20 (Moenman et al., 1986, supra\ Fire et al., 1991, Development 1 13:503-514), A mixture of 
three plasmids was injected: [myo-3 promoter:amc-22 antisense], [myo-3::unc-22 sense], 
and a marker plasmid (pRF4[rol-6(sul006g£)] [Mello et al, 1991]). Frequencies of Unc 
transgenic animals were followed in Fl and F2 generations. The 'Unc phenotype was 
weak. Wild-type animals bearing this transgene exhibit a strong twitching phenotype 

25 consistent with unc'22 interference. The twitching phenotype was strongly suppressed 
by both rde-1 and rde-4 mutants (Table 3). The mut-7 and rde-2 mutants which are both 
sensitive to unc-22(RNAi) by microinjection were also sensitive to promoter driven mc- 
22 interference in the muscle (Table 3). Taken together these findings suggest that rde- 
1(-^) and rde-4(+) activities are not necessaiy for uptake or stability of the interfering 

30 RNA and may function directly in the target tissue. 
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TABLE 3 : SENSITIVITY OF rde AND mut STRAINS TO TRANSGENE-DRIVEN 
INTERFERING RNA 



Unc Animals in Unc F2 Linesin 
Transgenic Fl Inherited Lines 


Wild type (N2) 


26/59 


10/11 


rde-1 (ne219) 


0/25 


0/3 


nie-2 (ne221) 


35/72 


14/14 


rde-3 (ne298) 


l"/38 


179 


rde-4 (ne299) 


0/51 


0/4 


mut-7 0^04) 


9/13 


3/3 



5 



Example 6: Molecular Identification of the rde-l Gene 

The rde-l gene was cloned using standard genetic maj^ing to define a physical 

genetic interval likely to contain the gene using YACs and cosmids that rescue rde-1 
1 0 mutants. These were used to identify a cloned rde-1 cDNA sequence and a cloned rde-4 

sequence. These methods can also be used to identify the genes for rde-2, rde-3, and rde- 

S using the mutant strains provided herein. 

To clone an rde-l gene, >«ast artificial chromosome clones (YACs) containing C. 

elegans DNA from this interval were used to rescue the rde- J mutant phenotype. To 
1 5 facilitate this analysis candidate rescuing YACs were co-injected with plasmids designed 

to express «nc-22(RNAi). YAC and cosmid clones that mapped near the rde-l locus 

were obtained from A. Coulson. rde-l{ne2l9) was rescued by YAC clones: Y97C12 and 

Y50B5. The two overlapping YAC clones provided rde-l rescuing activity as indicated 

by unc-22 genetic interference with characteristic body paralysis and twitching in the Fl 
20 and F2 transgenic animals. In contrast a non-overlapping YAC clone failed to rescue 

resulting in 100% non-twitching transgenic strains (Figure 4 A). 
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The rescuing activity was further localized to two overlapping cosmid clones, 
cosmid C27H6 and T10A5, and finally to a single 4.5kb genomic PCR fragment 
predicted to contain a single gene, designated K08H10.7 (SEQ ID N0:1; Figures 5A-5C) 
The K08H10.7 PCR product gave strong rescue when amplified from wild-type genomic 
5 DNA. This rescue was greatly diminished using a PCR fragment amplified from any of 
the three rde-^l alleles and was abolished by a 4 bp insertion at a unique Nhcl site in the 
rde-l coding legioa A wild-type PCR product from an adjacent gene C27H6.4, also 
failed to rescue. 

The K08H10.7 gene from each of the rde-l mutant strains was sequenced, and 
10 distinct point mutations were identified that are predicted to alter coding sequences in 
K08H10.7 (Figure 4A). Based on these findings rde-l can be identified as the K08H10.7 
gene, 

A fiilMengdi cDNA sequence was determined for rde-^l using the cDNA clones* 
yk296bl0 and yk595h5. cDNA clones for rde-l were obtained from Y. Kohara (Gene 

1 5 Network Lab, National Institute of Genetics, Mishima 411, Japan). The cDNA sequence 
of coding region and 31JTR was determined on yk296bl0 except that the sequence of 
5*UTR was determined on yk595h5. The GenBank accession number for rde-l cDNA is 
AF 180730 (SEQ ID N0:2). The rde-l cDNA sequence was used to generate a predicted 
translation product (SEQ ID N0:3), referred to as RDE-1 , consisting of 1020 amino 

20 acids. The RDE-1 sequence was used to query Genbank and identify numerous related 
genes in C elegans as well as other animals and plants. This gene family includes at 
least 23 predicted C elegans genes, several of which appear to be members of conserved 
subfemilies. Withm subfamilies, conservation extends throughout the protein and all 
family members have a carboxy-terminal region that is highly conserved (Figure 4B). 

25 Besides the genes shown in Figure 4B, other related genes include 

ARGONAUTE \{Arabidopsis), SPCC736.1 1(S. pombeX and Piwi {Drosophila), A 
portion of the N terminal region of RDE-l showed no significant similarity to any of the 
identified related genes. There are no defined fimctional motifs within this gene family, 
but members including RDE-1 are predicted to be cytoplasmic or nuclear by PSORT 

30 analysis (Nakai and Horton, 1999, Trertds Biochem. ScL 24:34-36). Furthermore, one 
family member named eIF2C has been identified as a component of a cytoplasmic 
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protein ftaction isolated from rabbit reticulocyte lysates. The RDE-l protein is most 
similar to the rabbit eIF2C. However, two other C elegans family members are far more 
similar to eIF2C than is RDE-1 (Figure 4B). RDE-1 may provide sequence-specific 
inhibition of translation initiation in response toxisRNA. 

5 The rde-I mutations appear likely to reduce or eliminate rde-i (•^) activity. Two 

rde-1 alleles ne2}9 and ne297 are predicted to cause amino acid substitutions within the 
RDE-l protein and were identified at a frequency similar to that expected for simple loss- 
of-function mutations. The rde'l(ne219) lesion alters a conserved glutamate to a lysine 
(Figure 4B). The rde'l(ne297) lesion changes a non-conserved glycine, located four 

10 residues from the end of the protein, to a glutamate (Figure 4B), The third allele, ne300, 
contains the strongest molecular lesion and is predicted to cause a premature stop codon 
prior to the most highly conserved region within the protein (Q>Ochre in Figure 4B). 
Consistent with the idea that rde-1 (neiOO) is a strong loss of function mutation, we found 
that when placed in trans to a chromosomal deficiency the resulting deficiency trans- 

1 5 heterozyotes were RNAi deficient but showed no additional phenotypes- These 

observations suggest that rde-l alleles are simple loss-of-fiinction mutations affecting a 
gene required for RNAi but that is othervwsc non-essential. 

Because of its upstream role RNA interference (see Examples 8-10 below), the 
RDE-1 protein and fragments thereof can be used to prepare dsRNA that is useful as an 

20 RNAi agent. 

Example 7: Maternal Establishment and Paternal Transmission of RNAi 

To examine whether the interference effect induced by RNAi exhibited linkage to 

the target gene (e.g., was involved in a reversible alteration of the gene or associated 
25 chromatin), a strain was constructed such that the FI males that cany the RNAi effect 

also bear a chromosomal deletion that removes the target gene (Fig. 7B). In the case of 

linkage to the target gene, the RNAi effect would be transmitted as a dominant factor. 
In experiments testing the linkage of the interfer^ce effect to the target gene, 

three different species of dsRNA (pos-1 dsRNA, mom-2 dsRNA, or sgg-1 dsRNA) were 
30 delivered into C. elegans in independent experiments. The dsRNA was delivered by 

injection through a needle inserted into the intestine. In general, dsRNA was synthesized 
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in vitro using T3 and T7 polymerases. Template DNA was removed from the RNA 
samples by DNase treatment (30 minutes at 37^C). Equal amounts of sense and antisense 
RNAs were then mixed and annealed to obtain dsRNA. dsRNA at a concentration of 1-5 
mg/ml was injected into the intestine of animals. In control experiments, mixtures of 

5 linearized template DNA piasmids used for syndiesizing RNA &iled to induce 

interference in PO^ Fl , or F2 animals ^en injected into the intestine of hermaphrodites at 
a concentration of 0.2 mg/ml. Fig. 7A illustrates this experiment. The gonad of the 
parent (PO) hermaphrodite has symmetrical anterior and posterior U-shaped arms as 
shown in Fig. 7A. Several fertilized eggs are shown in Fig. 7A, centrally located in the 

1 0 uterus. The rectangular mature oocytes are cued up in the gonad arms most proximal to 
the uterus. The embryos present in PO at the time of injection gave rise to unaffected Fl 
progeny. Oocytes in the proximal arms of the injected PO gonad inherit the RNAi effect 
but also cany a functional maternal mRNA (Fl carriers of RNAi). 

After a clearance period during which carrier and unaffected Fl progeny are 

1 5 produced, the injected PO begins to exclusively produce dead Fl embryos with the 
phenotype corresponding to the inactivation of the gene targeted by the injected RNA 
(Tabara et al. 1999, Development 126:1; C Rocheleau, 1997, Cell 90:707). Potential Fl 
and F2 carriers of the interference effect were identified within the brood of the injected 
animal. In the case of hermaphrodites, carriers were defined as '"afifected" if the animals 

20 produced at least 20% dead embryos with phenotypes corresponding to maternal loss of 
function for the targeted locus. In the case of males, carriers were defined as animals 
whose cross progeny included at least one affected F2 hermaphrodite. The total number 
of carriers identified in each generation for each of three dsKNAs injected is shown in 
Fig. 7 A as a fraction of the total number of animals assayed 

25 To examine the extragenic inheritance of RNAi» experiments were carried out 

investigating whether sperm that inherit the deletion and dierefore have no copies of the 
target locus could carry the interference effect into the F2 generation. Fl males that 
carried both pos-I (RNAi) and a chromosomal deficiency for the pos^l locus were 
generated. The chromosome carrying the deficiency for pos-1 also carried a deficiency 

30 for phenotypically uncoordinated (unc). F2 progeny of the carrier male includes two 
genotypes: phenotypically wild-type animals that inherit the (+) chromosome, and 
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phenotypically uncoordinated (Unc) progeny that inherit the mDG chromosome. In these 
experiments, the deficiency bearing sperm were just as capable as wild-type sperm of 
transferring interference to the F2 hermaphrodite progeny (Fig. 7B). Thus, the target 
locus was not needed for inheritance of the interference effect. 

5 Surprisingly, although males were sensitive to RNAi and could inherit and 

transmit RNAi acquired from their mothers, direct injections into males fiailed to caxise 
transmission of ElNAi to the F 1 for several genes tested. In an example of this type of 
protocol, wild type males were injected with targeting dsRNA: body muscle structural 
gene unc-22^ cuticle collagen gene sqt-S, maternal genes pos*I and sgg-L Males of the 

10 pes'l0::gfp strain (Seydoux, G. and Dunn, MA, 1997, Development 124:2191-2201 were 
injected with gfp dsRNA. Injected males were affected by unC'22 and gfp dsRNA to the 
same extent as injected hcraiajAroditcs. No RNAi interference was detected in Fl 
progeny or injected males (40 to 200 Fl animals scored for each RNA tested. Therefore, 
the initial transmission of RNAi to Fl progeny may involve a mechanism active only in 

1 5 hermaphrodites while subsequent transmission to the F2 progeny appears to involve a 
distinct mechanism, active in both hermaphrodites and males. The hermaphrodite- 
specific step may indicate the existence of a maternal germline process that amplifies the 
RNAi agent. These data show that extracts from the maternal germline tissues of C. 
elegans may be used in conjunction with RDE-1 and RDE-4 activity to create and to then 

20 amplify RNAi agents. 

In addition, the germline factors that amplify the RNAi agents can be identified 
by mutations tiiat result in an RNAi deficient mutant phenotype. Such factors can be 
used as additional components of an in vitro system for the efficient amplification of 
RNAi agents. 

25 

Example 8: Sufficiency of Wtld«Tvpe Activities of rde-L rde-2. mut-?. and rde'4 in 
Injected Animals for Interference Among Fl Self Progeny 

To investigate whether the activities of rde-i, rde'2, rde-4, and /ni//-7, 
respectively, are sufficient in injected hermaphrodites for interference in the Fl and F2 
30 generations, crosses were designed such that wild-type activities of these genes would be 
present in the injected animal but absent in the Fl or F2 generations. To examine 
inheritance in the Fl generation, (hermaphrodite) mothers heterozygous for each mutant 
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(PO) were injected, allowed to produce self-progeny (Fl) and the homozygous mutant 
progeny in the Fl generation were examined for genetic interference (Fig. 8A). To do 
this, the heterozygous hermaphrodites from each genotype cla5S> rde-l, rde-2, 
mia-7, dpy-J 7/+; and r</e-^, unc'69/-^ (the following alleles were used in this 
5 study: rde'J(ne300) unC'42, rde'l(ne2l9), rde-2(ne22]), rde-4(ne299), and mut- 

7(pk2040) were injected with pos-I dsRNA. In each case, two types of Fl self progeny, 
distinguished by the presence of the linked marker mutations, were scored for 
interference (Fig* 8 A). In these experiments the rde-I and rde'4 mutant Fl progeny 
exhibited robust interference, comparable to wild-type, while the and m«/-7 Fl 

1 0 progeny failed to do so. In control experiments, homozygous Fl progeny from 

heterozygous (uninjected) mothers were directly injected with pos-l dsRNA (Fig. 8B). 
Injection of dsRNA directly into the rde-I and rde'4 mutant progeny of uninjected 
heterozygous mothers friiled to result in interference. Thus, injection of dsRNA into 
heterozygous hermaphrodites resulted in an inherited interference effect that triggered 

15 gene silencing in otherwise RNAi resistant rde-I and rde'4 mutant Fl progeny while rde- 
2 and mut-? mutant Fl progeny remained resistant. 

In this experiment, the expression of n/e-i(+) and rde-4(+) in the injected animal 
was sufficient for interference in later generations. 

These data suggest that treatment of a dsRNA mth functional rde-J and rde'4 

20 gene products can produce an agent that activates the remainder of the RNAi pathway. 

Example 9: Requirements for rde- 1. rde''2. rde-4, and mut-J in Fl and F2 interference 
To examine the genetic requirements for RNAi genes in the F2 generation^ Fl 

male progeny were generated that carry the interference effect as well as one mutant copy 
25 of each respective locus; rde-J, rde'2^ and mut-l (Fig. 9A). Each of these males was then 

backcrossed with uninjected hermaphrodites homo^gous for each corresponding mutant 

(Fig. 9A). The resulting cross progeny (Fl) included 50% heterozygotes and 50% 

homozygotes that were distinguished by the presence of the linked marker mutations. 

The heterozygous siblings served as controls and in each case exhibited interference at a 
30 frequency similar to that seen in wild-type animals (Fig, 9 A), In these crosses, rde'2 and 

mwr-7 homozygous F2 progeny failed to exhibit interference, indicating that the activities 
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of these two genes are required for interference in the F2 generation. In contrast, we 
found that homozygous rde-l F2 animals exhibited wild*type levels of F2 interference 
(Fig 9A). Control rde-l homozygotes generated through identical crosses were 
completely resistant to poS'l::RNAi when challenged de novo with dsRNA in the F2 

5 generation. In these experiments, 35 rde-l homozygous aninuJs generated through 
crosses shown in Fig. 9A were tested by feeding bacteria expressing pos-l dsRNA, and 
21 similar animals were tested by direct injections of pos-I dsRNA. All animals tested 
were resistant to pos-I (RNAi). Thus, rde-l activity in the preceding generations was 
sufiicient to allow interference to occur in rde-l mutant F2 animals while the wild-type 

] 0 activities of rde-l and mut-7 were required directly in the F2 animals for interference. 

In tfiis experiment, the expression of rde-li,^) and rde-4{^) in the injected ammai 
was sufficient for interference in later generations. The wild-type activities of the rde-2 
and mut-7 genes were required for interference in ail generations asayed. Thus, rde-2 and 
mut"? might be required only downstream or might also function along with rde^l and 

15 rde-4. 

These data lend additional support to the concept that an appropriately treated 
dsRNA could be used as an RNAi agent. 



Example 10: Sufficiency of rde-l Activity to Initiate RNA Interference in Injected 
20 Animals That Lack the Wiid-Tvpe Activities of rde-2. mut-7. or rde-4 

To ask if rde-2 and mut-7 activities function along with or downstream of rde-l , 

genetic cross experiments were designed In vdiich the activities of these genes were 

present sequentially (Fig, 9B). For example, rde-l{^yj-de-2{-) animals were injected with 

pos-l dsRNA and then crossed to generate F I hermaphrodites homozygous for rde-l{-)\ 

25 rrfc-2(+). In these experiments rde-l{^) activity in the mjected animals was sufficient for 
Fl interference even when the injected animals were homozygous for rde-2 or w«^7 
mutations (Fig. lOB). In contrast, rde-li-^) activity in the injected animals was not 
sufficient when the injected animals were homozygous for rde-4 mutant (Fig. I OB). 
Thus, rde-l can act independently of rde'2 and mut-7 in the injected animal, but rde-l 

30 and rde-4 must function together. These fmdings are consistent with the model that rde-l 
and rde-4 function in the formation of the inherited interfering agent (i.e., an RNAi 
agent) while rde-2 and mut-7 function at a later step necessary for interference. 
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In summary, the above Examples provide genetic evidence for the fomialion and 
transmission of extragenic interfering agents in the C elegans germline. Two C. elegans 
genes, rde^l^ and rde-4, appear to be necessary for the formation of these extragenic 
agents but not for interference mediated by thenu In contrast^ the activities of two other 
5 genes, and mut-7, are required only downstream for interference. 

These examples provide evidence that the rde-l and rde'4 gene products or their 
homologs (e.g., from a mammal) can be used to prepare agents effective in activating the 
RNAi pathway. 

10 Example 1 1 : rde-4 Sequences 

An rde'4 gene was cloned using methods similar to those described in Example 6. 
The nucleic acid sequence (SEQ ID N0:4)and predicted amino acid sequence (SEQ ID 
N0:5) are illustrated in Fig. 10. 

Analysis of the rde'4 nucleic acid sequence shows that it encodes a protein (RDE- 

1 5 4) with similarities to dsRKA binding proteins. Examples of the homology to XIRBPA 
(SEQ ID N0:6; Swissprot: locus TRBP__XENLA, accession Q91836; Eckmann and 
Jantsch, 1997, J. Cell Biol. 138:239-253) and HSPKR (SEQ ID N0:7; AAF13156.1; Xu 
and Williams, 1998, J. Into-feron Cytokine Res. 18:609-61 6), and a consensus sequence 
(SEQ ID N0:8) are shown in Fig. 1 1. Three regions have been identified within the 

20 predicted RDE-4 protein corresponding to conserved regions foxmd in all members of this 
dsRNA binding domain family. These regions appear to be important for proper folding 
of the dsRNA binding domain. Conserved amino acid residues, important for 
interactions with the backbone of the dsRNA helix, are found in ail members of the 
piotem family including RDE-4 (see consensus residues in Figure 1 1). This motif is 

25 thought to provide for general non-sequence-specific interactions with dsRNA. The 
RDE-4 protein contains conserved protein folds that are thought to be important for the 
assembly of the dsRNA binding domain in this family of proteins. Conserved amino acid 
residues in RDE-4 are identical to those that form contacts uith the dsRNA in the crystal 
structure of the XIRBP dsRNA complex. These findings strongly suggest that RDE-4 is 

30 likely to have dsRNA binding activity. 

Because RDE-4 contains a motif that is likely to bind in a general fashion to any 
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dsRNA and because RDE-4 appears to function upstream in the generation of RNAi 
agents, the RDE-4 protein or fragments thereof can be used to convert any dsRNA 
into an RNAi agent. In addition to the dsRNA bmding domain, RDE-4 contains other 
functional domains that may mediate the formation of RNAi agents. These domains may 
5 provide for interaction between RDE-4 and RDE-1 or for binding to enzymes such as 
nucleases that convert the dsRN A into the RNAi a^ent. Because of its RNA binding 
function in RNA interference, the RDE-4 protein and fragments thereof can be used to 
prepare dsRNA that is useful in preparing an RNAi agent. 

10 Example 12: Identification of Regions of RDE-1 and RDE-4 that are Required for 
Creating an RNAi Agent 

In vivo and in vitro assays are used to identify regions in RDE- 1 and RDE-4 that 
are important for the generation of RNAi agents. In the in vivo assay, rde-l and rde-4 are 

1 5 introduced into the corresponding C. elegans mutant strains via transgenes (Tabara et ai.. 
Cell 99:123 (1999); and Example 13). Important functional domains in RDE-1 and RDE- 
4 are defined by systematically altering the proteins followed by reintroduction into 
mutant animals to test for rescue of the RNAi deficient phenotype. A series of nested 
deletions are analyzed for rescue activity for both rde-l and rde-4. Specific point 

20 mutations are used to analyze the importance of specific amino acids. Chimera's are 
produced between RDE proteins and related proteins and genes. For example, coding 
sequences from RDE-1 homologs from the worm or from human are tested for their 
ability to rescue rde-l mutants. Replacing the RDE-4 dsRNA binding motif with a 
distinct RNA bindmg motif, e.g., one that recognizes a specific viral dsRNA sequence or 

25 a ssRN A sequence will alter the specificity of the RNAi response perhaps causing 

sequence-specific or ssRNA-induced gene targeting. In one form of the in vitro assay, 
whole protein extracts from rde-l or rde-4 deficient worm strains are used. 

Recombinant RDE-1 or RDE-4 is then added back to reconstitute the extract. 
Altered RDE-l and RDE-4 proteins (described above, including deletions, point mutants 

30 and chimeras) are made in vitro and then tested for their ability to function when added 
back to these extracts. RNAi activity is analyzed by injecting the reconstituted extracts 
directly into animals or by assaying for the destruction of an added in vitro synthesized 
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target mRNA. 

Example 13: Rescue of rde*4 Animals 

Rescue of animals (e.g., C. elegans) that are mutant for an RNAi pathway is a 

5 useful method for identifying sequences from RNAi pathway genes that encode 
fimctionai polypeptides, e.g., polypeptides that can eliminate the mutant phenotype. 

An example of such a method for identifying rde-4 mutant animals is as follows. 
PCR using primers located 1 kb upstream and 500 nucleotides downstream of the open 
reading frame (T20G5.1 1 ; illustrated in Fig. 12) are used to amplify the rde-4 gene j&om 

10 C. elegans genomic DNA. The resulting PCR product is then injected along with reporter 
constructs described in Tabara et al. (Cell 99:123 (1999); incorporated herein in its 
entirety by reference), and the progeny of the injected animal are assayed for rescue of 
the KN Ai deficient phenotype. The PCR product can also be cloned into a plasmid 
vector for site directed mutational analysis of RDE-4 (sec Example 12). Co-injection of 

1 5 such a wild type RDE-4 plasmid and altered derivatives can be used to identify functional 
domains of rde-4. Similar methods can be used to identify functional domains of rde-1 
and other RNAi pathway components. 

Other Embodiments 

20 It is to be understood that while the invention has been described in conjuiKtion 

with the detailed description thereof, the foregoing description is intended to illustrate 
and not limit the scope of the invention, which is defined by the scope of the appended 
claims. Other aspects, advantages, and modifications are within the scope of the 
following claims. 

25 
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1 1 . An isolated nucleic acid molecule comprising a nucleotide sequence encoding 

2 an RDE- 1 polypeptide, wherein the nucleic acid molecule hybridizes under high 

3 stringency conditions to the nucleic acid sequence of Genbank Accession No, AFl 80730 

4 (SEQ ID N0:2) or its complement, or nucleic acid sequence set forth in SEQ ID NO: 1 or 

5 its complement. 

1 2. The isolated nucleic acid of claim 1 , wherein the nucleic acid can complement 

2 an rde- 1 mutation. 

1 3. An isolated nucleic acid of claim 1, wherein the nucleotide sequence encodes 

2 the amino acid sequence of SEQ ID N0:3. 

1 4. A substantially pure RDE-i polypeptide encoded by the isolated nucleic acid 

2 of claim 1. 

1 5. Anantibody that specifically binds to an RDE- 1 polypeptide. 

1 6. A method of enhancing the expression of a transgene in a cell, the method 

2 comprising decreasing activity of the RNAi pathway. 

I 7, The method of claim 6, wherein rde-2 expression or activity is decreased. 

1 8. An isolated nucleic acid molecule comprising a nucleotide sequence encoding 

2 an RDE-4 polypeptide, wherein the nucleic acid molecule hybridizes under high 

3 stringency conditions to the nucleic acid sequence of SEQ ID N0:4 or its complement. 

1 9. The isolated nucleic acid of claim 8, wherein the nucleic acid can complement 

2 an rde-4 mutation. 

1 1 0. An isolated nucleic acid of claim 8, wherein the nucleotide sequence encodes 

2 the amino acid sequence of SEQ ID NO:S. 

1 11 . A substantially pure RDE-4 polypeptide encoded by the isolated nucleic acid 

2 of claim 8. 

1 1 2. An antibody that specifically binds to an RDE-4 polypeptide. 

1 13. A method of preparing an RNAi agent, the method comprising incubating a 

2 dsRNA in the presence of an RDE-1 protein and an RDE4 protein. 
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14. A method of inhibiting the activity of a gene, the method comprising 
introducing an RNAi agent into a cell, wherein the dsRNA component of the RNAi agent 
is targeted to the gene. 

15. The method of claim 1 4, wherein the cell contains exogenous RNAi 
sequences. 

16. The method of claim 14, wherein die exogenous RNAi sequence is an RDE-1 
polypeptide or an RDE-4 polypeptide. 



51 



wo 01/29058 



PCT/USOO/28470 



1/21 




Mutagenize 
Egl strain 



PO 



F2 



Select on E. coli expression pos-1 dsRNA 

/ \ 




Candidate rde mutants 
(viable progeny) 



Non-mutants 

(Bag of dead embryos) 



FIG. 1A 



LGl 



LGil 



LGV 



1 m.u.T 



rde-2(ne221) 
nle-3(ne29ep' 



-dpy-17 



mut-7(pk204) 

rde-4(ne29^, 
jdpy-U ne301) 
-unc-13 



\ 



-unc-32 



rde-1(ne219, 
ne297, ~ 
ne300} 



-dpy-11 
-unc-42 



FIG. 1B 



wo 01/29058 



PCT/USOO/28470 



2/21 



CO CM 

(N TJ- O 

lO CO (O 

UD SS Cvj 

CM O 

»0 CO CO 



o 



in 

CM 



< 

CM 

CD 




I 



I 



CM 
I 



5 




CO 
CM 

in 
in 

CM 

in 



(A 



f 

CO 

o 
a 



o o 
CO CO 

o o 



GO 
CO 

<o 

CO 
CO 



CD 
CM 

d 



CM CO in 
CO CO m 

CM CO CO 

CO CM ^ LL 

? §. 

E 

0) 




(0 



(D 



CM 
CM 

>S. 

CM 
f 

"2 



OO 
CJ) 
CM 
Q) 

CO 

i 



o> O^ 
CJ> lo 



CM 



Cm 

5 



o 

CM 



wo 01/29058 



PCTAJSOO/28470 



4/21 




wo 01/29058 



PCTAJSOO/28470 



5/21 



a. a 




€0 H I 

« I t . 
M U a »4 Cm 

flC < a o« H 
H u K m 

fiC M (0 O 
2 pi H |J M 

01 « 5£pQr. 

oi ^^ fn ^ca 



M I 



I 

CQ 



1 I I 



i 



CQ 



a H 0) 




01 



n m vD 
o n CM 00 

C* CM CM H 



r-l 

M 

r-t r* U l> 

I b» <M C 

W flt> fM M •'^ 

a ^ M > 

&» O N CO 



lU 



fOl 




t CO 


1 


o 




: % 


Z 


H 




t X 


1 


1 




1 M 


1 


1 




t O 


1 


1 




» C3 


1 


» 




» < 


I 


t 




i 01 


1 


) 




1 X 


1 


1 




» < 


1 


t 






1 


i 




» o 


1 


I 




» < 


1 






















ci n 




CM 


CM 










• 




ca 








•J 




« &« CM 


.4 


c: 


(4 CD 


b 


H 




Q ^ 


M 






(>« 


0 




o» 



O 

. t« :c fd »4 
»j O o) OS p« 




VO CJ\ o 

<n o CM 
m ^ CM 



o 

t li« CM 

H a> Ctt 



g WW < o 

H CO X K 
!0 »4 




o 




CM 


in 






in 






r- 


o 


CM 




o 


in 




cn 


n 






r> 




cn 
















» 






« 




» 












O 




Oi 


»4 


c 


1 


fa 


N 




C 


H 


■H 


u 


CO 


bt 


H 


•H 


de 


4J 






M 


St 


i> 


M 


CO 




b 




M 


CO 



wo 01/29058 



PCT/USOO/28470 



6/21 



\0> . 

X Q 
M 

ICQ 



H 



CO 
> « 



W CJ LO 03 



M X ^ 
O I i I 





CO 2 01 < cu 




H 

tn 





bl t 1 O 

^4 I I CO 

M I i > 

Q t i ^ 

H I t O 

I I I >^ 

1 t I cw 

< 1 I o 

U I I o 

PC I i 

O i I 

X >i 




'J) tr> 'A in 

mt 

SiiS 

M M H )-( . 

to 00 V) to 



a a & 






fH CI CM 0< 
r4 OD <K ^ CD 
tf> lA 4n in ^ 



^ (D O CM 

o tf» ^ M m 

VD ^ \0 m 



^ lO U> fH rl* 

o» N n o> CM 
u> u> 



«-i r«- a iJ o> 

f b* C4 ,J) C 

U OD bl M -H 

Q ^ M ^ 

flC fiu O N QO 



1^ O cn 

I bl (M »J C 

b] (D bl M 

Q ^ M ^ 4J 

b O N CO 











• 




bl 






O 


•4 




1 b4 






c 


a CD 


bi 




•H 


o ^ 


H 






(t Cm 


9 




CO 



wo 01/29058 



PCT/liSOO/28470 



7/21 



tt a K ot 






9i O 



odOQQ 



qQOQQ 

ooooq 

UjUJUJLiJUJ 






J J J 



o CM n u> I*" 
CD cv o r~ CO 
r- r- v£> r« iO 



r4 u O 

t bi oi i-l C 

Ed CD Cm H -H 

Q ^ M ^ 4J 

« {ik O M M 



O 00 CT» <*1 ^ 

u> r» tn u) 
CD a> vo 00 



w 

H r- O ^4 o» 

I [II c 
H CD bi H 
O H 9E 4J 

« D* o N m 



»4 

Gl Oi H » >« 

Kt o ae o 14 




O W 
\0 <*> ^ W fM 

<jv o\ o\ a> 



fH u 

M O) tu M 
^ M * 



•4 



0» 

c 
4i 



CO 
I 

OQ 

u. 



6( 9 N CQ 



wo 01/29058 



PCTAJSOO/28470 



8/21 



2 r 



on 



OA 

DO 
U 



u 
u 

Oi) 

u 



C3 
O 

01) 



en 

OD 

OA 

C8 

CQ 

CQ 

a 
a 

DA 

OtO 



OX) 
OD 



00 

u 



00 
CI 

o 



oil 

n 
u 
u 



CI 

rt 

00 

00 

CJ 

0£ 

00 

« 



w) 2 00 2 ^ 

2 00 



00 

C3 

C3 

u 

00 

00 

c3 

c« 

DO 
00 



C3 

n 
00 

00 

C3 
C« 

00 

es 

oo i* 



CI 

00 

00 

t: 

u 

A 
C3 

cs 

CI 

00 
00 

u 

C9 
00 

u 



00 
« 

00 
w 

« 
« 

00 
00 

« 

CI 
00 

n 

00 



00 

cs 

00 

C9 

CI 

Cl 

CI 

et 

00 

ru 
to 

01) 

u 

w 

00 
00 

C3 

es 

CI 
Cl 
C« 



C3 

e« 

00 

C9 

00 

u 

00 



€9 
U 
CS 

u 



" e« w 

t: 01 

t2 *-* 



cl 
u 

00 

tt 

00 
00 



« < 

S LO 



ii C S 



00 

CI 

o 



00 



C9 
C5 
00 

u 
u 
u 



C9 
CJ 
00 

u 
u 

oo 

a 

Cl 
C5 
Cl 
00 

oo 

Cl 

rt 
u 

ct 

u 

00 

Cl 



Cl 
Cl 



Cl 
00 



00 
Cl 

Cl 
♦«* 
Cl 



Cl 

00 

C9 

Cl 

Cl 

00 

Cl 

Cl 

oo 

CQ 
00 



Cl 

oo 

Cl Cl 

ei 



Cl ii oo 01 

^ w 



00 
Cl 
Cl 

00 

00 

Cl 

Cl 

Cl 

« 

« 

cs 

Cl 
Cl 
00 

00 

d 
oo 



Cl 

Cl 



d 

00 

Cl 

Cl 

oo 
u 



00 

Cl 
««• 

00 

Cl 

Cl 

« 

00 
Cl 
00 
CO 



cs 

Cl 
00 

Cl 



Cl 

d 

Cl 
Cl 

rt 

Cl 

00 
Cl 



Cl 

u 

&0 

Cl 
u 

Cl 
Cl 
Cl 

c« 

Cl 

oo 

Cl 



00 
00 

Cl 
Cl 

oo 



00 

Cl 
Cl 
Cl 

Cl 

00 

00 
00 
Cl 
00 

Cl 
Cl 
00 

00 




Cl 
C3 
Cl 
Cl 
Cl 

oo 

00 
u 



rj w 



u *- 



(J 

Cl 

cl 

Cl 
Cl 

00 

cx 

CJ 
Cl 
Cl 

00 

u 
00 



00 
Cl 

00 

cl 

cl 

Cl 

oo 



Cl 
Cl 

ei 

00 
00 

cl 
00 

00 

Cl 
Cl 

n 



w CO 



Cl 
00 

00 
Cl 

Cl 

Cl 

Cl 

00 

00 

00 



00 

Cl 
Cl 

00 

Cl 
Cl 

u 

Cl 

Cl 
Cl 

00 



Cl 



Cl 

00 

Cl 

(1 

00 
Cl 

oo 

Cl 

a 
oo 

Cl 

« 

00 
Cl 

2? 

oo 

00 

o 



cl 
n 

Cl 



00 
00 

00 

Cl 
Cl 

rt 

Cl 

Cl 

rt 

Cl 

u 

w 

00 

Cl 
CJ 

d 
d 

00 

d 
00 

00 



u 
d 

d 
d 

00 

d 
d 

d 
u 
d 

w 

d 



d 
d 
d 
d 
d 
d 
d 

00 

d 

00 

00 



d 

00 

00 

• * 



CJ 

u 

*-* 

<w 

d 
d 
d 
<j 
d 

00 

d 

00 
00 

d 
d 

00 
o 

oo 

d 

d 

00 

d 
d 

00 

u 

d 

oo 

d 

d 

d 

d 

d 

DO 

oo 

CJ 
CJ 



u 

CJ 

00 

CJ 

00 

d 

CJ 

00 

CJ 

00 

d 
d 
d 

00 

d 

u 

CJ 

Cl 

d 

00 

d 



d 
d 
o 
d 

00 

d 
d 



d 
d 
d 



d 
o 

CJ 

d 
d 
u 

CJ 
CJ 

d 

CJ 



DO 



d 
d 

DO 

d 

CJ 



o 
00 

d 



d 
d 

d 
d 

d 
d 

d 
d 



d w cl 



wo 01/29058 



PCT/USOO/28470 



9/21 

tctgcgagttcctgaatcgmcacgatccnaacagactcgaacaatcattagaagtagcaccaagaatcgaagcatggt 

ctggaatttacattggaaccaaagaattgttcgatggtgaacctgtgctcaattttgcaagtaagtttgagaaactgcga 

taaaaaatcatgtgatttttgttgaagttgtcgataaactattctacaatgcaccgaaaatgtctcttctggaitatctt 

ctcctaattgtcgacccccagtcgtgtaacgatgatgtacgaaaagatcttaaaacaaaactgatggcgggaaaaatgac 

aatcagacaagccgcgcggccaagaattcgacaattactggaaaamgaagctgaaatgcgcagaagtttgggataac 

aaatgttagtttaaattattcaaacaattaatatacaaattgattttcaggtcgagattgacagaacgacatctgacatt 

cctagatttgtgcgaggaaaactctcttgtttataaagtcactggtaaatcggacagaggaagaaatgcaaaaaagtacg 

atactacattgttcaaaatctatgaggaaaacaaaaagttcattgagtttccccacctaccactagtcaaagttaaaagt 

ggagcaaaagaatacgctgtaccaatggaacatcttgaagttcatgagaagccacaaagatacaagaatcgaattgatc 

ggtgatgcaagacaagtttctaaagcgagctacacgaaaaccTcacgaccacaaagaaaataccctaaaaatgctgaaa 

aattggatttctcttccgaagagctaaattttgttgaaagatttggattatgctccaaacttcagatgatcgaatgtcca 

ggaaaggttttgaaagagccaatgcttgtgaatagtgtaaatgaacaaattaaaatgacaccagtgattcgtggatttca 

agaaaaacaattgaatgtggttcccgaaaaagaactttgctgtgctgtttttgtagtcaacgaaacagcgggaaatccat 

gcttagaagagaacgacgttgtgtaagtgttttctacgugattattccgaaatattttcagtaagttctacaccgaact 

aattggtggttgcaagttccgtggaatacgaattggtgccaatgaaaacagaggagcgcaatctattatgtacgacgcga 

cgaaaaatgaatatgccgtaagtttcagaaaattgaaagtttttaaatatcatatttacagttctacaaaaattgtacac 

(aaataccggaatcggtagatttgaaatagccgcaacagaagcgaagaatatgtttgaacgtcttcccgataaagaaca 

aaagccttaatgttcattatcatttccaaacgacaactgaatgcttacggttttgtgaaacattattgcgaccacaccac 

cggtgtagctaatcagcatattacttctgaaacagtcacaaaagctttggcatcactaaggcacgagaaaggatcaaaac 

gaattttctatcaaattgcattgaaaatcaacgcgaaattaggaggtattaaccaggagcttgactggtcagaaattgca 

gaaatatcaccagaagaaaaagaaagacggaaaacaatgccattaactatgtatgttggaattgatgtaactcatccaa 

ctcctacagtggaattgactattctatagcggctgtagtagcgagtatcaatccaggtggaactatctatcgaaatatga 

ttgtgactcaagaagaatgtcgtcccggtgagcgtgcagtggctcatggacgggaaagaacagatattttggaagcaaa 

ttcgtgaaattgctcagagaattcgcagaagtgagttgtcttgagtatttaaaagatctctgggaittttaatttttttg 

FIG. 5B 



wo 01/29058 



PCTAJSOO/28470 



10/21 



OA 

eS 

W) 

c« 

66 

W> 

u 

c« 

OA 
o 

w 
w 

OA 

CQ 

00 

DO 

09 

00 

CI 

00 

u 

2 

u 

00 

cs 

00 



69 
00 

00 

C3 
U 
(.» 
CI 

u 

00 

c« 

00 



e9 

00 

C9 
U 

00 
CI 
C9 
00 

00 
00 

00 

00 
00 
u 

« 

00 
00 

u 
00 

cs 
u 

w 

u 

n 

00 

a 

es 

00 

u 

00 

rt 

C3 



00 
00 

u 
u 

DO 

n 

CI 

00 

« 

CO 
CO 

00 

a 
n 

CO 
■w 

CO 
00 

o 
u 

w 

a 
o 

« 

00 

rj 

n 

00 

n 



on u H 

s s ? 



C8 

w 

00 
u 
a 
ct 

e« 
n 
00 

a 



CI 



00 

u 



CO 



00 

ca 
eq 
e« 

00 
€« 

n 

a 

00 

2 

00 
00 

00 
00 

(t 

00 
00 
00 

u 
cl 

ct 

r 

00 

u 

00 

00 

<-> 
00 

£? 

C3 
OA 

rt 
a 

CI 
00 

OA 



00 

CI 
CI 

w 

ci 

CI 

ei 
00 



u 

00 

CI 

op 
c« 

00 
00 

a 
a 

et 

00 
u 

00 



n- 



00 



DO 



OA 
CI 

00 

u 
« 



& 

oo 
u 
u 

t: 

00 

u 

00 

c 

OA 

u 



00 

a 
u 

00 



o 

00 

00 
00 

Ci 



^ 00 



<9 



00 Of 

tj w TT 



OO 

€9 

00 

OA 
CS 

u 
•*■« 

00 
c« 

OO 
«•« 

00 



e9 
cs 
a 

w 
C4 
00 
00 

00 

u 

d 
cs 

a 
cs 

00 
00 
o 

C3 

cs 
a 

cs 
es 

00 



cs 
u 

CO 

S- 

CS 
00 

cs 
cs 

CO 

u 

CA 



^ 00 



CI 

u 
cs 

OA 

CS 

CS 

cs 

n 
cs 
u 



cs 
cs 

cs 
cs 

00 
00 

cs 
u 

00 



cs 
cs 

OA 

OA 

u 

00 

00 
00 



00 

OA 

cs 



ct 

OA 

OA 
C8 

OA 

cs 

00 

cs 

"S 

es 

OA 



OA 

OA 

cs 
cs 



CI 
CI 

u 
u 
cs 
es 
cs 
c« 

00 

u 

00 

cs 

00 

cs 

00 



o 
00 

CI 

d 

00 
OA 

u 
cs 

cs 

OA 



ci cs 

cs ^ 

00 u 

00 y 

cs ^ 

cs 2 

u ^ 

5 

IS cs 

cs 00 

00 s 

O CI 

t: « 

cs ■»-» 



u 
00 
cs 
00 

cs 
cs 
cs 

CO 
u 

00 

cs 

CI 

cs 
n 
00 




OA w 



CI 

cs 

00 
00 

cJ 
es 
cs 
d 

OA 

Ci 
cs 
cs 



cs 
cs 



cs 

OO 
00 



00 

cs 

CI 

ct 
00 

CJ 



cs 
cs 
cs 
cs 

OA 

cs 
o 



OA 



00 

ct 



cs 

CI 

u 
OA 



cs 

CJ 



cs 
w 
d 
d 

00 
00 

d 
d 
d 

00 

CI 

d 

CI 

d 

00 

CI 

u 
es 

00 
00 

o 
d 

CI 



cs 
es 

00 



d 
00 

d 

00 



d 
d 



d 
a 
ct 
u 

d 
d 



OA 
u 

CI 

o 

00 
CI 



a 

LU 



d 
cs 
d 

00 

u 
d 

d 
d 
d 

d 



OO 
CI 

d 
d 

cS 
d 

00 



00 



00 

d 
d 
d 
a 



d 

CI 

u 

d 
c^ 



% 



wo 0V29m 



PCT/USOO/28470 



11/21 

CAGCCACAAAGTGATG.-jy^C- =' U7R 

l/I 31/11 

.~.TG . CC TCo j*-Ai • 'vCC GaA •Tj GAA AAA iiZuA «T* ;»">T C'.^l ^AT ^.s. G-"*.t •_C'«3 G/iG 

Het ser sez ssr» c-ne cro ziu lau riu Ivs cly z'ne cyr arg his ser lau aso oro ciu 

61/21 91/31 

ATG AAA 7GG CTT GCG AGG CCC ACT 3GT AAA TGC 3AC GGC AAA TTC TAT GAG AAG AAA 6TA 
mei: lys crp iau =la ars zzo ztz ^iy iys cys ssp giy lys ?he cyr clu lys lya vai 

121/n 151/SI 

CTT CTT TTG GTA AJ^T TGG TTC AAG TTC TCC AGC AAA ATT TAC GAT CGG GAA TAG TAG GAG 
leu leu leu vai asn crp phe lys phe ser ser iys ile cyr asp arg glu cyr cyr glu 

131/61 211/Tl 

TAT GAA GTG AJ^A ATG ACA AAG C-AA GTA TTG AAT AGA AAA CCA GGA AAA CCT TTC CCA AAA 
cyr alu val lys -,Bt chr lys ciu vai leu asn arg lys pro giy lys pro phe pro iys 

241/81 271/91 

AAG ACA GAA ATT CCA ATT CCC GAT CGT GCA AAA CTC TTC TGG CAA CAT CTT CGG CAT GAG 
lys chr glu ile pro ile pro asp arg aia lys leu phe crp gin his leu arg his glu 

301/101 331/111 

AAG AAG CAG ACA GAT TTT ATT CTC GAA GAC TAT GTT TTT GAT GAA AAG GAC ACT GTT TAT 
lys lys gin chr asp phe ile leu ciu asp cyr val phe asp glu lys »sp chr val cyr 

361/12X 391/131 

AGT GTT TGT CGA CTG A^C ACT GTC ACA TCA AAA ATG CT5 GTT TCG GAG AAA GTA GTA AAA 
ser vai cys arg leu asn chr vai chr ser lys rvec leu vai ser glu lys vai val lys 

421.141 451/151 

j-.-svy vrn i •-•j Urtv> ----n ^.-v* ^nt\ .--nu . . o o.-.o ^ .cv^ .-.Cn r\l\j .-.-.-i 

lys asp ser ciu lys Ivs asp ciu lys 3sp leu ciu lys lys ile l^u cyr chr met ile 
481/161 511/171 

--.CC .-ni ,--AA ^riL — ^ rvrtC - - i r-.'-j i. '^on orLr\ .--^i o/irt .--rtA U/\V- -^/v* 

leu Chr ryr arg lys lys phe his leu asn phe ser arg glu asn pro glu lys asp glu 
c41/i31 57l/i?l 

GAA GCG .^T CGG AGT TAC .-AA TTC CTG AJ\G P^-AT GTT ATG ACC TAG r^J, GTT :GC TAC GCG 

ciu aid asn arc ser zyr lys phe leu lys asn val ^^ec chr gin lys val erg cyr aia 

501/201 631/211 

CCT TTT GTG AJ^C GAG C-AG ATT .-AA GTA CAA TTC GCG AAA AJ^T TTT GTG TAC GAT AAT T-XV 
pre Che vai asr. ciu glu ile lys vai gin phe aia lys asn phe val cyr asp asn asn 

661/221 591/231 

Mtino « Oi** r^mrm — • — . — ^ ^^'^ — "H ~ — • « ^^9. ."Li 

i •••O ^.-tn. . ... JAi -wvfk . -JV. -. -*» -.A . VrtA 

ser ile leu arg val pre glu ser phe his asp pro asn arg phe glu gin ser leu glu 



'21/241 751/251 

FIG. 6A 
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^- \:-wn .-.\jr» r.i.-- c--rt ew. - J\J ^-rrv * ^s>A ff..w .-wi ^.-^ - .U cAA 

val 5 la zrc aC3 'I^li rlu ale trp chc lift cvr lie 9iy :.le Ivs clu lea cae asp 
"31/261 311/271 

jrui -c.-w^ Vs.. -jtvj v-.v, .--j^.l ... G^•rt nlT v?ni .nAA V-i.n iiU ..-iC rtni cvn rtTG 

rly ::iu cro vai leu isn che aia ile vai sso Ivs leu one zvr asn als oro l/s .T.ec 
=41/291 371/291 

w.vj '^rii ..-.1 w.* v-A rii 1 OiL. -jnL. \,rwu iCvj .--nU ^.-vA Vj^rt V,or\ 

ser lau leu asD ivr lea le'j l2u lie vai aso cro cln ser cys asn eso aso vai arc 
501/301 931/311 

AAA GAT CTT AAA ACA AAA CTS ATG GCG GGA AAA ATG ACA ATC AGA CAA GCC GCG CGG CCA 

1/5 osp leu lys thr lys leu net ala qly lys met thr ile atg gin ala aia arg pro 

?6i/321 991/331 

AGA ATT CGA CAA TTA TTG GAA AAT TTG AAG CTG AAA TGC CCA GAA GTT ZOG GAT AAC GAA 
arq ile acq gin leu leu glu asn leu lys leu lys cys aia giu vai trp asp asn glu 

1021/341 10S1/3S1 

ATG TCG AGA TTG ACA GAA CGA CAT CTG ACA TTT CTA GAT TTG TGC GAG GAA i^AC TCT CTT 
r.ec ser arc lau thr riu arc his leu zhz phe ieu asp leu cys giu giu asn ser leu 

1081/361 1111/371 

GTT TAT AAA GTC ACT GGT AAA TCG GAC AGA GGA AGA AAT GCA AAA AAG TAG GAT ACT ACA 
vai tyr lys vai chr giy iys ser asp arg giy arg asn ala lys lys cyr asp thr thr 

1141/38X 1171/391 

TTG TTC AAA ATC TAT GAG GAA AAC AAA AAG TTC ATT GAG TTT CCC CAC CTA CCA CTA GTC 
leu phe lys ile tyr glu giu asn iys iys phe ile giu phe pro his ieu pro leu vai 

1201/401 1231/411 

AAA GTT AAA ACT GGA GCA ?M GAA TAG GCT GTA CCA ATG GAA CAT CTT GAA GTT CAT GAG 
1/5 vai iys ser giy aia lys glu cyr ala vai pro net glu his ieu giu vai his glu 

12Sl/s21 1291/431 

.rjr.\j N-.-vft .-.vjH ..-.v ---nu f-nt. w^n . i. i. trrn - j Liiu j*. avj «-/irt ruw - • . .nMVj v^vijrt 

IVS pro 2ln srq Tvr lys asn arg ile ssp leu vai -.et -^in asp lys phe leu iys arg 
1321/441 1251/451 

■3CT ACA CGA AAA CCT CAC GAC TAC AAA GAA AAT ACC CTA AAA ATG CTG AAA GAA TTG GAT 
sla thr arg lys pro his asp cyr l/s ciu asn chr ieu lys niet leu lys clu leu asp 

1231/461 1411/471 

TTC TCT TCT GAA GAG CTA AAT TTT GTT GAA AGA TTT GGA TTA TGC TCC AAA CTT CAG ATG 
phe ser ser glu glu leu asn che vai giu arg phe giy leu cys ser lys leu gin met 

1441/481 1471/491 

ATC GAA TGT CCA GGA AAG GTT TTG AAA GAG CCA ATG CTT GTG AAT AGT GTA MiT GAA CAA 
ile giu cys pro giy lys vai leu lys giu pro met leu vai asn ser vai asn giu gin 

lEOl/SOl 1331/511 

AIT AAA ATG ACA CCA 37G AIT CGT GGA TTT CAA GAA PJ^A CAA TTG AAT GTG GTT CCC GAA 
lie Ivs -et thr crs vai lie arc sly che clr. aiu iys cin leu asn vai vai pro giu 
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lys giu lea cys cys 5i£ v*l she vai vai asn ciu cr.r aia ciy asr. pre cvs leu clu 



1621/541 1551/551 

'oii .--r.vj .-.v.-*. vsru-i r;t i joi - .--nvs * . «- •jnj/* v-.iA 

jiu asn asp vai vai 1/s cr,e ^yr cr.r giu leu lie ciy qly cys lys cae arq giy lie 



1681/561 

rti i %^\Ji rvni 

2rg ile ciy aia asn 



4,^^ ^ jttm ^ 

or>/v .-tvjn Vj'^n '>^v««j 

glu ssn arg ciy aia 



. / 1 / 3 / i 

i-iib r>-lo ifv^ 

cir. ser ile mec ryr 



^jrtC v^CCi ACG AAA AAT 
asp aia ^hr lys asn 



:741/58x 

NSJvi lAT i.v^ I AC nAA .r^'.i .%jT .-.uA ...n 

ciu tyr aia phe tyr iys asn cys thr leu 



1771/591 

AAT ACC GGA ATC GOT AGA TTT GAA ATA GCC 
asn rhr giy ile gly arg she ciu ile aia 



1301/601 

GCA ACA GAA GCG AAG AAT ATG TTT GAA CGT 
aia chr giu aia iys asn .T.ec che giu arg 

1961/621 

:TC ATT ATC ATT TCC .-AA Z'k CAA CTG .-AT 
Che ile ile ile ser lys arc ;in leu asn 



1831/611 

CTT CCC GAT AAA GAA CAA AAA GTC TTA ATG 
leu pro asp iys giu gin iys vai leu met 

1891/631 

GCT TAC GGT TTT GTG AAA CAT TAT TGC GAT 
aia cyr giy phe vai iys his tyr cys asp 



1921/641 

CAC ACC ATC GGT G7A GCT ,-AT CAG CAT ATT 

his thr ile giy vai aia asn gin his ile 

1981/661 

TCA CTA AGG CAC GAG AAA GC-A TCA AAA CGA 

ser leu arg his ciu lys ciy ser lys arg 



1951/651 

ACT TCT GAA ACA GTC ACA AAA GCT TTG GCA 
thr ser giu thr vai thr iys aia leu aia 

2011/671 

ATT TTC TAT CAA ATT GCA TTG AAA ATC AAC 
ile phe tyr gin ile aia ieu iys ile asn 



2041/631 

GCG AAA TTA GGA GGT ATT AAC CAG GAG CTT 

sia iys Isu ciy ciy ila asn gin ciu leu 

11 Ji/70i 

jr'-rv vArt \;run r,\it\ .-.-lt- a . ^ v, w -i 

^jlVl avjd C«>^ SlkiM • •••• m, 



2071/691 

GAC TGG TCA GAA ATT C-CA GAA ATA TCA CCA 
asp trp ser glu lie aia giu ile ser pro 

2 131/ "^11 

. ,.n !\\m% i.-ii .-1 i i ^.-.4. ns* I 

leu thr r.et 'yr vai riy -ie asp vai ihr 



1161/721 

;M ^ Ift f^f^^ Mtt^ • r» ^ " ^•W^ ^•W* 

..nT v-w k.'^W .-.ma r. ii s^rti .ni 

r.iS pro t^r ser cyr ser ciy ile asp tyr 



2191/731 

TCT ATA GCG GCT GTA GTA GCG AGT ATC AAT 
ser ile aia aia vai vai aia ser ile asn 



2221/741 2251/751 

:TA GGT GGA ACT ATC TAT C:A AAT ATG ATT GTG ACT CAA GAA GAA TGT CGT CCC GGT CAG 
pro giy giy thr ile tyr arc asn niet ile vai thr gin glu glu cys arg pro giy glu 



1281/761 2211/771 

.UA Ls\. ^ /^vjrt oni. ii > .vj V:rtr\ txnXa . ruvV «. 

=rg aia vai sia his ciy src ciu arc thr asp lie leu giu aia lys she vai lys leu 



:341/731 
rTC AGA wAA 



' ^ r ^ 



.eu arc c^u cr.e aia ciu a 



sn asn 



371/791 



GTA G 4 C TAT 



^ OA 



rsAi V?^/\ 

;So asn arc aia cro aia his ile vai vai tyr arg 
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:401/=0i 2431/311 

:-iC 3GA C-T? AGC C-A? ?CC- AIG CTA CC-T C-TT AGT Cn? OAT GAG CTt C3A TCT TTA AAA 
ssp giy vai ser asp set riu .T.et Isu srg vai sar his asp giu leu arg ser lau lya 

2-161/321 2491/331 

AGC GAA GTA AAA CA* TTC ATG TCG GAA CGG GAT G-SA GAA GAT CCA GAG CCG AAC 7AC ACG 
s&r clM vai iys cin phe .T.ec ser glu arg asp cly giu asp pro giu oro iys tyr cJic 
2521/841 2551/351 

CTC ATT G7G ATT CAG AAA AGA CAC AAT ACA CGA TTG CTT CGA AGA ATG GAA AAA GAT AAG 
phe lie vai ile gin lys arg his &sn z'r.z &rq Isu leu arg arg met giu iys asp lys 

2591/861 2611/371 

CCA GTG GTC AAT AAA GAT CTT ACT CCT GCT GAA ACA GAT GTC GCT GTT GCT GCT GTT AAA 
pro vai vai asn lys asp leu chr pro aia giu chr asp vai ala vai ala ala vai lys 

26U/881 2671/891 

CAA TGG GAG GAG GAT ATG AAA GAA AGC AAA GAA ACT GGA ATT GTG AAC CCA TCA TCC GGA 
gin trp giu giu asp sier lys giu ser lys giu ttit giy lie vai asn pco ser ser gly 

2701/901 2731/911 

ACA ACT GTG GAT AAA CTT ATC GTT TCG AAA TAC AAA TTC GAT TTT TTC TTG GCA TCT CAT 
cnr thr vai asp Iys leu ile vai ser lys cyr lys phe asp phe phe leu ala ser his 

2761/921 2791/931 

CAT GGT GTC CTT GGT ACA TCT CCT CCA GGA CAT TAC ACT GTT ATG TAT GAC GAT AAA GGA 
his ^ly vai leu gly chr ser arg pro gly his cyr chr vai met tyr asp asp lys gly 

2S21/941 2851/9S1 

ATG AGC CAA GAT GAA GTC TAT AAA ATG ACC TAC GGA CTT GCT TTT CTC TCT GCT AGA TGT 
niec ser gin asp giu vai cyr lys met chr tyr gly leu ala phe leu ser ala arg cys 

2881/961 2911/971 

CGA PAA CCC ATC TCG TTG CCT GTT CCG GTT CAT TAT GCT CAT TTA TCA TGT GAA AAA GCG 
srg lys pro ile ser leu pro vai pro vai his cyr aia his leu ser cys giu, lys ala 

:?4L/991 2971/991 

.-AA C-AG CT7 TAT C::^A ACT TAC AAG GAA CAT TAC ATC GGT C-AC TAT (^CA CAG CCA CGG ACT 
lys clu leu cyr arc z'r.r lyr lys qi'i r.is cyr ile giy asp cyr ala gin pro arg chr 

3001/1001 3031/1011 

CGA CAC GAA ATG GAA CAT TTT CTC CAA ACT AAC GTG AAG TAC CCT GGA ATG TCG TTC GCA 
arg his giu met giu his phe ieu gin chr asn vai lys tyr pro giy met ser phe ala 

3061/1021 3091/1031 

T^A CAT TTT GCA AAA GTG TCG CCC GTT TCA ATC AAA TTT TTC AAT TGT AGA TAT TGT ACT 

cCK {SEQIDN0:3) 

3121/1041 3151/1051 

TAC TTT TTT TTA AAG CCC GGT TTC AAA AAT TCA TTC CAT GAC TAA CGT TTT CAT AAA ITA 

3131/1061 

CTT GAA ATT TAA AAA AAA AAA AAA AAA (SEQ ID N0:2) 
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10 20 30 40 50 60 

MDLTKLTFESVFGGSDVPMK 

70 -80 90 100 UO 120 

CCTTCCCGATCGX3AGGATAACAMACGCXAA^^ 

PSRSEDNKTPRNRTDLEMFL 

130 140 150 160 170 180 

KKTPLMVLEEAAKAVYQKTP 

190 200 210 220 230 240 

ACTTGGGGCACTGiaSAACrTCCTGAAGGCT^^ 

TWGTVELPEGFEMTLILWSl 

250 260 270 280 290 300 

ACTGTAAAAGGCCAGGCAACAAGCAAGAAAGCTGCGAGAC^^ 
TVKGQATSKKAARQKAAVEY 

310 320 330 340 350 360 

TTACGCAAGGTtCnC^GAGAAAOGAAAGCACGAAAT^^ 

LRKVVEKGKHEIFFIPGTTK 

370 380 390 400 410 420 

GAAGAAGCiCUU'iCGAATATTCATCAAATATCGGATAAGGClGAGG^ 
EEALSNIDQISDKAEELKRS 

430 440 450 460 470 480 

ACTTCAGATGCPCOTCAGGATAAay^^ 

TSDAVQDNDNDDSIP TSAEF 

490 500 510 520 530 540 

CCACCTGGTATTiaxrCAACCGAGAAlTCGGaXXXSAAAG^ 

PPGISPTENWVGKLQEKSQK 

550 560 570 580 590 600 

AGCAAGCTGCAAGCCCCAATCTATGAAGATTCCAAGAATGAl^ 
SKL .QAPX YEDSKNERTERFL 

610 620 630 640 650 660 

GTTATATGCACGATGTGCAATCAAAAAACCAGAGGAATC^^ 
VICTMCNQKTRGIRSKKKDA 

670 680 690 700 710 720 

AAGAATCTIGCAGCATGGTIGATGTGGAAAGCGTIG^ 

KNLAAWLMWKALEDGIESLE 

730 740 750 760 770 780 

TCATATGATATGGTTCAaXSTCATIGAA^ 

SYDMVDVIENtiEEAEHLLEI 
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790 800 810 820 830 840 

CAGGATCAAGCATCCAACaATTaAAGACAAGCATO 

QDQASKIKDKHSALIDILSD 

850 860 870 880 890 900 

AAGAAAAGATirrcAGACTACAGCATGGAlTIt^ 

KKRFSDYSHDFNVLSVSTMG 

910 920 930 940 950 960 

ATACATCaGGIXXrtATIGGAAATCTCGrTC 

IHOVt.LEISFRRLVSPDPDD 

970 980 990 1000 1010 1020 

TTGGAAATGGGAGCAGAACACACCCAGACTGAAGAAAOT^ 

LEMGAEHTQTEEIMKATAEK 

1030 1040 1050 1060 1070 1080 

GAAAAGCTAaxaAGAAGAATATGCCAGATlCCGGG^^ 

EKLRKKNMPDSGPLVFAGKG 



1090 1100 1110 1120 1130 1140 

TCATCGGCGGAAGAGGCTAAACAGTGTGCTTGl^^ 

SSAEEAKQCACKSAIIHFNT 

1150 1160 1170 1180 1190 1200 

TATGATITCACGGATTGAAAATATTATIGCGTATTCCr^^ 
YDFTD*KYyCVFLKNEASE* 



1210 1220 1230 

TmTAAAAAAAAAAAAAAAAAA (SEQ ID N0:4) 

L * ^ K K K K (SEQIDN0:5) 
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